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THE RELATIONSHIP BETWEEN IQ AND SIZE OF 
FAMILY 


H. E. G. SUTHERLAND 


Department of Education, Edinburgh University 


INTRODUCTION 


Following the recent article by Theodore Lentz, Jr.‘ on the relation 
Of IQ to size of family, printed in this journal, the time may be oppor- 
tune to furnish some additional data on the problem of the apparently 
higher rate of increase of the less intelligent members of modern 
communities in comparison with the more intelligent. 

A number of investigations have been made in this field. Duff and 
Thomson found the correlation between the parents’ occupation and 
the intelligence of the children to be approaching 0.30. Investigations 
by Macdonald, by Sandiford, and by Haggerty and Nash have been 
in agreement with this.* 

Bradford, comparing the British census returns for 1921 with the 
average intelligence of the occupational groups as found in these 
investigations, shows that an increase of the mean intelligence of the 
children from group to group corresponds with a decrease in the 
average number of children per family among those groups. He also 
gives a correlation between size of family and IQ of —.25 +.03 for some 
four hundred city children.* 





1 Lentz, Theodore, Jr.: Journal of Educational Psychology, Vol. XVIII, October, 
1927, p. 486. 

2 Duff, J. F. and Thomson, Godfrey H.: British Journal of Psychology, Vol. 
XIV, 1923, p. 192; Macdonald, zbid., Vol. XVI, 1925, p. 123; Sandiford, School 
and Society, Vol. XXIII,1926, p. 117; Haggerty and Nash, Journal of Educational 
Psychology, Vol. XV, 1924, p. 559. 

3 Bradford: Can Present Scholastic Standards be Maintained? Forum of 
Education, Vol. III, 1925, p. 186. 
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Terman selected ninety-two families which could be reckoned as 
completed from five hundred seventy-eight families yielding gifted 
subjects and, within this highly selected group, found the correlation 
between size of family and IQ to be —.271 + .062 or, with schooling 
of mid-parent constant, —.246 + .063.! 

Chapman and Wiggins? with six hundred fifty cases give correla- 
tions between family size and intelligence of offspring and socio- 
economic status of the family, the correlation between size of family 
and IQ being —.33 + .024. 

Pearson and Moul,* however, with twelve hundred subjects from 
the Jews Free School, Aldgate, London, find no significant correlation 
between size of family and IQ. 

Sutherland and Thomson,‘ with eight hundred forty and ten 
hundred eighty-four elementary school children from the Isle of Wight, 
obtained correlations between size of family and 1Q of —.15 + .023 
and —.218 + .019. 


FATHERS OF SuBJECTS BELONG TO ONE OCCUPATIONAL GROUP 


The need has been indicated for further studies on this subject and 
the purpose of the present paper is to furnish data relative to children 
whose fathers all belong to one occupational group, namely miners. 
This group would come under the fourth category of the Taussig Scale 
(semi-skilled labor) and would be given a rating on the Barr Scale of 
Occupational Status of 4.29. The total number of cases is thirty 
hundred ninety-six. Miners form a well defined group, and the 
fathers here considered all belonged to the rank and file of the occupa- 
tion, everyone who held any kind of distinguishing position whatsoever 
being excluded. Miners in Britain live in villages where they form 
the great majority of the inhabitants, frequently in houses or cottages 
supplied by the owners of the mine, and their wage is strictly regulated 
by Trade Union agreements. They form as homogeneous a group for 
social conditions as is likely to be found anywhere. 

This group is thus of special interest. It will be seen that here, 
also, where the factor of environment is almost constant, there is a 





1 Terman, M. L.: ‘‘Genetic Studies of Genius,’’ Vol. I, 1925. 

2Chapman, J. C. and Wiggins D. M.: Pedagogical Seminary and Journal of 
Genetic Psychology, 1925, p. 414. ; 

’ Pearson, Karl and Moul, Margaret: The Annals of Eugenics, Vol. I, 1925. 

4 Sutherland, H. E. G. and Thomson, Godfrey H.: British Journal of Psychol- 
ogy, Vol. XVII, 1926, p. 81. 
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quite distinct tendency for the children with higher intelligence to 
come from the smaller families. 


THE INTELLIGENCE TEST 


The test given was one of the Moray House Group Tests stand- 
ardised by Professor Godfrey Thomson and his students and assistants. 
It was used on sixty thousand school children between the ages of 
eleven and thirteen, in the West Riding of Yorkshire, in 1926, of which 
number the thirty thousand ninety-six cases here dealt with form a 
part. 


TABLE I.—RANDON SAMPLE 


IQ CATEGORIES NUMBER OF CaAsEs 
120+ 28 
115 to 120 38 
110 to 115 53 
105 to 110 93 
100 to 105 121 

95 to 100 73 

90 to 95 48 

85 to 90 54 

80 to 85 43 

—80 30 

Average 1Q = 99.9 Total..... .581 


To indicate the validity of the test, a random sample of five hundred 
eighty-one cases was taken from the varied occupational groups repre- 
sented, and it was found that these cases gave a distribution as shown, 
while the average IQ was found to be 99.9. Table I shows the IQ 
distribution of this random sample. 

Next, the correlation between age and IQ was worked out for 
eleven hundred forty-nine cases of the occupational group we are con- 
sidering, and r was found to be zero (—.039 +.019). Thus the 
intelligence quotient as determined by this test was shown to be 
uncorrelated with age (Table II). 


TREATMENT OF DATA 


The thirty hundred ninety-six cases dealt with in this paper are 
made up of two samples, a and 8, of eleven hundred six and nineteen 
hundred ninety cases respectively. Tables III and IV show the 
results found for the first sample (a). It will be seen that the mean 
IQ for each group according to size of family shows a slight but notice- 
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TaBLE III.—Grovup a 


CHILDREN IN THE FAMILY AveRAGE IQ NuMBER OF Cases 
1 92.9 44 
2 94.9 126 
3 92.6 172 
4 90.7 189 
5 92.4 172 
6 90.7 140 
7 90.3 116 
8 88 .7 56 
9 88.8 52 

10 Combining 89.3 21 
11 10 and 90.2 10 
12 upwards 91.7 6 
13 = 89.4 80.0 2 

Mean IQ = 91.6 Total.... 1106 


TaBLeE IV.—GrovpP a 
AVERAGE NUMBER OF 


IQ CHILDREN IN THE FAMILY NUMBER OF CASEs 
120+ 4.5 13 
115 to 120 4.4 21 
110 to 115 4.8 40 
105 to 110 4.6 80 
100 to 105 4.6 133 

95 to 100 4.7 150 
90 to 95 4.8 163 
85 to 90 5.2 165 
80 to 85 5.2 169 

—80 §.2 172 


Mean size of family = 4.89 Total.... 1106 


able tendency to decrease with the increase in the number of children, 
the decrease being from 94.9 to 88.8. The average 1Q for the group 
is low being 91.6. A slight tendency is also observable for the average 
size of family to increase so as we pass from the higher to the lower 
IQ levels. This increase is from 4.4 to 5.2. The average family for 
the whole group is fairly large being 4.9. 

In the case of the second sample (8) girls and boys were taken 
separately in order to find out primarily if the correlations between 
IQ and size of family showed any marked difference. Tables V and VI 
show that for the girls the average IQ decreases from 93.4 to 86.1 with 
increase in the size of family, the mean IQ for the whole group of girls 
being 91.2. The average size of family increases from 3.9 at 120+ to 


| 
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TaBLE V.—Grovp £8, GIRLS 





CHILDREN IN THE FAMILY AVERAGE IQ NUMBER OF CASES 
1 93.4 37 
2 93.2 113 
3 93.2 144 
+ 92.5 138 
5 91.6 129 
6 88.9 132 
7 89.0 79 
8 92.3 67 
9 86.1 39 

10 ( $4.2 24 

11 88.4 8 

12 Combining | 94.5 6 

13 10 and 89.5 2 

14 upwards 

15 = 87.1 

16 

17 | 98 1 
Mean IQ = 91.2 Total.... 919 


TasBLeE VI.—Grovp £, GIRLS 
AVERAGE NUMBER OF 


IQ CHILDREN IN THE FAMILY NUMBER OF CASES 
120+ 3.9 13 
115 to 120 3.4 17 
110 to 115 4.7 37 
105 to 110 4.5 64 
100 to 105 4.6 119 

95 to 100 4.9 112 
90 to 95 5.1 111 
85 to 90 5.0 152 
80 to 85 9.2 129 

—80 5.4 165 


—_—_— — 


Mean size of family = 4.97 Total....919 


5.4 at 80—, the mean family size for the whole group being 4.9. The 
number of cases is nine hundred nineteen. | 

Tables VII and VIII show the results for the boys of the second 
sample (8). The average IQ declines from 96.7 to 88.2 as the number 
in the family increases from group to group, the mean IQ for the whole 
group of boys being 90.6. As we pass from the higher IQ categories to 
the lower the average size of family increases from 4.0 to 5.2. The 
mean size of family for the whole of this group is 4.9 again, and the 
number of cases is ten hundred seventy-one. 
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TaBLeE VII.—Grovp 8, Boys 





CHILDREN IN THE FAMILY AVERAGE IQ NUMBER OF CASES 
1 96.7 36 
2 92.2 117 
3 91.5 183 
4 90.9 192 
5 90.1 163 
6 89.5 114 
7 89.3 109 
8 89.1 75 
9 88 .2 39 

10 ep * 24 
11 oe 88.1 14 
12 Ha 97.0 2 
13 poy 107.5 2 
14 ~ , | 89 l 

Mean IQ = 90.6 Total.... 1071 


TasLeE VIII.—Grovp £8, Boys 
AVERAGE NUMBER OF 


IQ CHILDREN IN THE FAMILY NUMBER OF CASES 
120+ 4.1 10 
115 to 120 4.0 21 
110 to 115 4.1 33 
105 to 110 4.7 84 
100 to 105 4.6 117 

95 to 100 4.8 125 
90 to 95 4.8 144 
85 to 90 5.0 159 
80 to 85 5.1 173 

—80 5.2 205 


Mean size of family = 4.90 Total.... 1071 


In each of these tables reversals occur, but the trend indicated is 
quite apparent. Tables III to VIII present a striking uniformity as 
regards average IQ, average size of family, and the range of average 
IQ values given. In this homogeneous group of thirty hundred 
ninety-six cases there is still a tendency for the brighter children to 
come from the smaller families, though this is doubtless less marked 
than would be the case in a group of children unselected for social 
status or for intelligence. 


CORRELATIONS OBTAINED 


Tables [X and X show the correlation between IQ and size of family 
for the groups a and 8. The coefficients are low but in the expected 
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TaBLE IX.—GrovUP a 
























































Size of family 
IQ Totals 
1| 2 3/ 4/5 | 6. Tlelo 10| 11) 12 13 
| { | | ' 

120+ ..| 1 4 2 2 2 2} ..) |. 13 
115-120 2,6 .., 1 5 5.) ..) Wd 21 
110-115 2 4}i2 7) 6 5 3 1 ..|.. 40 
105-110 1) 12| 19 14) 14) 5] 4) 4) 5) 1) 1) ..| ..| 80 
100-105 9 21) 17 18; 28/ 10} 18; 3 5) 1) 1) 2) ..) 183 
95-100 9| 25| 18! 25) 20; 19 19) 4) 5) 4 2 150 
90- 95 7| 20; 25| 34) 17| 22| 13, 12) 8} 2) 1; 2) ..| 163 
85- 90 1| 12) 31) 29| 22| 24/ 18 141 7| 5| 2] ..| ..) 165 
80- 85 7 12} 22) 28) 32) 27; 17} 8| 8 3) 2! 2) 1) 169 

80 — 6| 13) 24) 31) 26 21) 22) 10/ 13) 4) 1) ..| 1) 172 
re / 44/126 172/189 172 140 116| 56, 52} 21) 10, 6 2) 1106 

r = —.129 + .019 


TABLE X.—GrouvpP 8 









































Size of family 
IQ ne antes og te BY | | Totals 
1| 2\3 | 4/5/6)7/8 id mo 

| rs) || 
120+ | als 4) al al...| 2. eTtetret 23 
115-120 5| 5 8 8 6 2 2 2..)..)..)..)..)..).-f ff 38 
110-115 1) 15] 11) 12} 14] 7) 2} alata...) zo 
105-110 /10} 15) 33, 27) 18 12) 11) 11) 6) 1) 1 1] 2)../..]..}..] 148 
100-105 —|16| 32} 38) 36| 32) 28° 26 17 5| 3) 1) 2)..|..|..]..]..) 236 
95-100 9| 30} 40| 42| 31) 26 24) 17| 7) 5 3/1 1)..)..|..) 1) 287 
90- 95 5| 24) 43| 53| 40| 33; 22) 19) 5] 4| 6 1)..|..|..|..)..| 255 
85- 90 13] 40| 44) 41) 57| 37, 26 25.13) 9 2) 3...) 1... ) $n 
80- 85 10) 32] 44) 52) 35, 42 36) 2018) 7 5)..| 1)..)..|..|..| 302 
80 — 4| 33) 58) 55| 57| 56, 39] 2621/17) 4../..|..)..|..|..} 370 

Totals.......... 73|230|327 330 292 246 188 142,78 48 22 8 4 1) 0) 0 1) 1990 
\ | | i 














r = —.126 +.015. 


direction, showing again that the brighter children, in this particular 
occupational group, tend to come from the smaller families. 

The negative correlations for these two samples corroborate one 
another in a very marked degree. The differentiation in regard to 
home environment in this occupational group is, I consider, as low as 
we may expect to find it, so that there is a strong presumption in favor 
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TaBLE XI.—Grovup £8 (Grris ONLY) 






































| Size of family 
IQ S ie —_—!| Totals 

ale 2 | 3 4 [| 5 | 6 | '7/8| {9 ton)1219|14 she) 

- l i | ! | 
120+ | al 4 2 a 2) aj fede. 13 
115-120 «89/3 2 3 6 1) 1) 1)..! 1p papal ata 17 
110-115 | 4 8 4 5 7 5 1 3, 2 1}..|. i” me 37 
105-110 | 44 7:16 «9 8 7 4! 6 2}..| 1)..]. 64 
100-105 =| 5 19 23) 17} 16) 12\11/10) 2) 2)..| 2). ae ..| 119 
95-100 | 5) 13) 20; 17) 18) 13 9 8, 3) 2 1) 1 | 112 
90-95 | 1| 10, 19] 22) 14) 161111 2| 2 2 1), | 111 
85-90 | 7, 22:14) 21) 30, 21114111 6 3 1) 2. 152 
80-85 8, 13, 14) 23 13| 23 910 12) 2| 1). 129 
80 — | 3) 16, 27| 16 21) 3219 71012 2... | Lo. 165 
i siianscas 37113 144 138129 132/79 67 39/24 8 6 2 0 0 0 1) 919 

r = —.139 + .022. 


TaBLeE XII.—Grovprp £8 (Boys ONLy) 





Size of ae 
IQ ——____ —___________________/ Totals 


























1 2| 8 OOOUDOEOOEE 
120+ | 4 e yo. 4 |. | al | 10 
115-120 | 3i ee ee ee | 
110-115 Joo yo at al alt oto) 3s 
105-110 6 817,18 10) 5 7 5 4 1). | ao2..| 84 
100-105 11) 13 15 19 16| 16 15) 7] 3 1) |...) ..| ..| 117 
95-100 4) 17| 20) 25 13 13) 15) 9} 4 3) 2) ..| ..| ..] 125 
90- 95 4! 14 24) 31) 26 17 11 8 3 2 4.) ..) ..| 144 
85-90 | 6| 18) 30 20 27) ” 12) 144 7 6 1) 1,..| 1) 159 
80-85 | 2/ 19 0 29| 22) 19 27 10] 6 5) 4 ..| ..| 173 
80 — | 1| 17 al 39) 36 24 20) 19} 11; 5} 2) ..; ..! ..| 205 
Totals...........| 36117 183/192 163 114 109 75| 39 24) 14 2 2 1) 1071 
| 1 
r= —.115 +.020 


of the view that these coefficients represent a tendency in regard to the 
distribution of intelligence which is chiefly attributable to the child’s 
inheritance from the parents, and not to the effect of varying degrees 
of contact with environmental influences more or less educative. 

An attempt was made to find out whether girls show a greater 
tendency to rise superior to their environment than boys. In the 
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second sample (8) correlation tables were therefore worked out for 
girls and boys separately showing the relationship between IQ and size 
of family. The results are shown in Tables XI and XII. It will be 
seen that the negative correlation is very slightly higher for the girls’ 
group than for the boys’ group, so that the effect of heredity may 
possibly come out somewhat more strongly in the case of the girls. 


CONCLUSION 


While previous investigations have in the main found negative 
correlations between intelligence and size of family with groups of 
subjects not homogeneous as regards social status and occupation of 
the parent, this paper would draw attention to the existence of the 
same phenomenon, though in a less marked degree, among a group of 
children whose fathers all follow one occupation and who are all of the 
same social status. 








ON TETRAD DIFFERENCES WITH OVERLAPPING 
VARIABLES 


KARL J. HOLZINGER 


University of Chicago 


The present paper was suggested by the remark of a friend: that 
Professor Kelley’s new book ‘‘Crossroads in the Mind of Man’”’ had 
overthrown Spearman’s Two Factor Theory. Professor Kelley of 
course makes no such sweeping statement, but the perusal of his 
book might lead one to think that the Spearman theory was a sort of 
doubtful by-product of a more general theory. It is the opinion of the 
writer that none of the evidence in Professor Kelley’s book discredits 
the Spearman theory in the least and that his best data amply support 
it. This paper will then be concerned with a reconsideration of certain 
of Professor Kelley’s data, showing that wherever it is legitimate to 
apply the Spearman tetrad test, the agreement with the theory is as 
good as, if not better, than that found with Professor Spearman’s own 
data. 

For a seventh grade ‘‘population”’ (the writer prefers the word 
‘“‘sample’’) of one hundred forty children, Professor Kelley has obtained 
intercorrelations of the following eight variables: 


Reading speed 

Reading power 

Arithmetic speed 

Arithmetic power 

Memory for words 

Memory for numbers 

Memory for meaningful symbols 
8 = Memory for meaningless symbols 


ore Wh 


Im 


A ninth valuable “‘space power’ has been omitted here merely to 
lighten the long task of calculation. 

The intercorrelations of these eight variables have been taken 
from Professor Kelley’s Table X and are reproduced here for 
convenience. 


Tic = .6328 Te; = .0553 r35 = .0420 Tyg = .2843 
Ti3 = .2412 To, = .0655 T3s6 = .2270 Ts6 = .7169 
Ti4 = .0586 To5 = .3097 T37 = .0215 rst = .6693 
T1i5 = .1950 Te = .2988 T33 = .0573 Ts: = .4662 
Tis = .2318 T27 = .oo2e T45 = .1487 rs? = .6319 
T17 = .2969 To: = .2501 Ts = .2893 Tes = .5050 
Tis = .1999 Ts, = .4248 Te = .2489 Tr, = .6915 
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The tetrad differences necessary for testing Professor Spearman’s 
Two Factor Theory are defined as follows: 


labed = Tabled — Tach ba: Thus tiosa = 112734 — T 137 24, 


tiea3 = 112734 — 114723, and tisag = 113724 — 114723. 


Since there are three such tetrad differences for every four variables, 
and four variables may be chosen from eight in °C, = 70 ways, there 
will be two hundred ten tetrad differences in all. These have been 
calculated by the writer and are set forth in Table I. 

Let us now assume a general factor a running through all of the 
eight variables, and certain other general and specific factors as 
indicated by the following scheme: 


21 = ca + kyb + jid + 81 | 

Le = C20 + kob + Se 

23 = c3a + ks + jd + 83 

Le = C10 + kage + 84 (1) 
25 = ca + kse + 85 

Le = Cea + kee + S6 

Xr = cra + 87 

Xs = c3a + 83 | 





The c’s, k’s, and j’s are constants, while a, b, c, d, e, and s’s are 
uncorrelated factors. It will be noted that a special reading factor b, 
arithmetic factor c, speed factor d, and memory (or other) factor e 
have been assumed. 

This simple set-up of the variables was suggested partly by inspec- 
tion of the light variables and partly by an examination of the tetrad 
differences presented in Table I. It would seem reasonable to postu- 
late a memory factor running through variables z;, x6, 27, and 2s, but 
analysis of the tetrad differences makes any special memory factor 
between z7; and zs unlikely. If such a factor were present then the 
tetrad differences for such quadruplets as 21, x4, 27, 2g; %1, %s, Xz, Xs} 
and 21, Xs, 7, x3 should not all vanish. From Table I, however, it is 
apparent that the largest of these twelve tetrad differences is —.043, 
which is insignificant. The four variables x;, x., z7, x3 furnish tetrad 
differences of .15, .20, and .04 which indicate a factor common to z; 
and zz, but not common to 2; and 2s in view of the former evidence. 
The factor e, therefore, may not be memory at all. 

All that is claimed for the system (1) is that it is a reasonable one 
and, as will be shown later, yields tetrad differences in good accord 
with those obtained from the data. Professor Kelley’s system of 








Tetrad Differences with Overlapping Variables 


TaBLE I.—TetTrap Propucts AND DIFFERENCES 





Variables 
a,b,c, d 


1234 
1235 
1236 
1237 
1238 
1245 
1246 
1247 
1248 
1256 
1257 
1258 
1267 
1268 
1278 
1345 
1346 
1347 
1348 
1356 
1357 
1358 
1367 
1368 
1378 
1456 
1457 
1458 
1467 
1468 
1478 
1567 
1568 
1578 
1678 
2345 
2346 
2347 
2348 
2356 
2357 
2358 
2367 
2368 
2378 
2456 
2457 
2458 
2467 


3467 
3468 
3478 
3567 
3568 
3578 
3678 
4567 
4568 
4578 
4678 
5678 














| 
| Tabled | TacTbd | TadT be | 


.269 | .016 — .003 
027 | .075 — 011 
.144 | .072 — .013 
.014 .080 — .016 
036 .060 —.0ll 
.094 .018 013 
183 018 015 
158 | .019 .019 
.180 .015 .013 
454 .058 .072 
.424 .065 .092 
295 .049 .062 
.400 | .077 .089 
.320 | .058 .060 
.438 | .074 066 
036 | .002 083 
070 | .013 098 
060 .001 126 
069 003 085 
.173 044 .010 
161 004 012 
.112 O11 .008 
152 .005 .067 
122 013 .045 
167 017 004 
.042 = .056 034 
.039 .049 .044 
.027 .055 .030 
.037 .058 .086 
.030 066 .058 
.041 084 .050 
.123 .155 .213 
.098 ~— 108 .143 
1385 | .138 134 
.160 | .150 126 
.008 | .003 .132 
016 | .015 127 
014 | .001 141 
.016 | .004 .106 
.040 | .070 .013 
.037 | .007 .014 
026 .018 O11 
.035 | .006 .075 
.028 | .017 .057 
.038 | .019 .005 
.047 | .090 .044 
.044 | .077 .049 
.031 | .088 .037 
.041 .074 .096 
.033 .085 .072 
.045 .094 .062 
.196 | .200 . 238 
.156 | .139 .179 
.214 .155 .167 
.207 .168 .158 
.305 .012 034 
. 284 .010 .003 
.198 | .012 .009 
.268 | .057 .006 
.215 | = .065 .017 
.294 .006 .014 
.027 | .152 .015 
.021 | .106 .041 
.029 | .010 .038 
.157 | .O11 .036 
094 | .194 .178 
075 | .135 . 204 
.103 | .116 .190 
.200 | .126 .180 
496 | .338 .295 


| 
| 


| 





tabed 


. 253 
.048 
.072 
.066 
.024 
.076 
. 165 
. 139 
. 165 
. 396 
. 359 
. 246 
.323 
. 262 
. 364 
.034 
.057 
.059 
-066 
. 129 
. 157 
-101 
.147 
- 109 
. 150 
.014 
.010 
.028 
.021 
.036 
.043 
.032 
.010 
.003 
-010 
.Ol1 
-031 
015 
.020 
.110 
.044 
.044 
.041 
.045 
.057 
.043 
.033 
.057 
.033 
-052 
.049 
.004 
.017 
.059 
.039 
. 293 
. 274 
- 186 
-211 
. 150 
. 288 
.125 
.085 
-019 
. 146 
. 100 
-060 
.013 
.074 
. 158 


tabde 











. 272 
.038 
.157 
.030 
.047 
.081 
. 168 
. 139 
. 167 
. 382 
. 332 
. 233 
311 
. 260 
.372 
.047 
.028 
.066 
.016 
. 163 
.149 
. 104 
.085 
.077 
. 163 
.008 
.005 
.003 
.049 
.028 
.009 
.090 
.045 
.001 
.034 
. 140 
. 143 
.155 
.122 
.053 
-051 
.037 
.110 
-085 





* Indicates that all three tetrads are zero. 
t Indicates two tetrads equal but not zero. 
~ Indicates no tetrads equal. 
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weights and variables may be better, but it seems needlessly complex 
for the present purpose. 

If only the a and s factors of equations (1) were involved, Professor 
Spearman’s theory would require all two hundred ten tetrad differ- 
ences to be zero (within the limits of sampling error to be discussed 
later). The presence of the other general factors b, c, d, and e makes 
the vanishing of all tetrad differences improbable. In fact, Professor 
Spearman would be the last to expect a set of variables such as 1, 2, 5, 
and 6 to have only a general and four specific factors. It is idle to 
expect tetrad differences to vanish if such overlap exists. If equations 
(1) are approximately correct, however, variables 1, 4, 5, and 7 should 
be expressible and due to only one general and four specific factors. 
Considering all eight variables, therefore, we shall expect certain tetrad 
differences to vanish, and certain others not to be zero. The types of 
tetrads that arise may be seen by expressing the correlations in general 
form, following the notation of Professor Kelley. From equations (1) 

Ti2 = aa2 + BiB2 


Tis = aja3 + 615; 
134 = asa + Y375 
T56 = Asa + €5€6 
All other correlations 7,;; will be of the form aja; where 
Cide kyon 
= 58; = —* ete. 
ay G; » B; G; 4 
Let us reconsider the variables 1, 2, 5, and 6 for illustration. 
tiess = Tiess — Tisl2s = (aia2 + B1B2)(asae + €s€5) — aasacars, 
ties = Tiolss — T1eT25 = (aia2 + B182)(asas + €s€s) — aiacares, 
tiseo = TisToe — bieTo5 = Q1a5a20%5 — ajasa2as = 0. 
We should therefore expect two of the tetrads to be equal, but not 
zero, and one to vanish. Reference to Table I shows that 


tiose = 396, tioes = 382, and lisse = —.014. 


Such a relation between the tetrads show overlap (here b in x; and 
ze and ein x; and xs). ‘The sets of four variables where this type of 
tetrads should occur are indicated in Table I by a dagger in the last 
column. These were determined from equations (1) by taking all 
possible sets of four variables where the second general factor occurred 
once or twice, é.g., 1245, 1246, 1247, 1248, 1256, ete. 

Another type of tetrad relationship is illustrated by variables 1, 2, 
3, and 4. 


ties = (@ia@2 + BiB2)(asag + yav4) — (ies + 5:53)ara, ~ 0, 
tioss = (aya2 + BiB2)(asag + avs) — aiasara; ¥ 0, 
tisaz = (arias + 5153)asay — aiayara; 4 O. 
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Here the tetrad differences are all unequal and none would be 
expected to vanish unless quantities like 6, or 6; vanish. These 
tetrads are indicated in Table I by the symbol + in the last column. 

The final, and for the present purposes, the most interesting type 
of tetrad difference is that where all three of a triplet set vanish. 
These are indicated in Table I by a star of which there are twenty- 
three, so that the total number of tetrad differences which should 
vanish is sixty-nine. 

As an illustration let us take the four variables 1, 4, 5, and 7. 

tiss7 = @Q4asa; — ajasaya; = VO, 
bias = @Qyasay — ajaj,aya, = 0), 
tisza = Gas, — aja,a,a; = Q, 

We have only to take all sets of four variables from equations (1) 
in which no common second factor occurs. By count there are twenty- 
two other sets of this type. 

Viewed in another way we may write 


m= ca+ kb + jid+s; = cat hty 


l4 = C4 a kac > S4 = C4 + Lats 
T5 = C5a oe ke oe S5 = C5a + Lsts 
Xr = cra + 87 = cra + lyty 


Where the /’s are new constants and the ¢’s are new specific factors 
formed from the old ones. In such a system we would expect all three 
tetrads to vanish. 

Now the question arises: How well do the theoretical types of tetrad 
triplets determined from equations (1) and indicated in Table I agree 
with those found in the data? ‘To answer this question fully it is 
necessary to work out the probable errors of the individual tetrad 
differences by the formula given by Spearman and the writer, or by 
Professor Kelley on p. 49 of his book. In passing it may be stated that 
the formulas are the same, excepting that Professor Kelley has kept a 
term which we dropped as ordinarily negligible. The calculations here 
given are by Kelley’s fuller formula even though the additional term 
proved to be negligible. The labor of such calculations is very con- 
siderable and only three examples are given here for illustration. 

tiass = —.028 + .029, taezs = .074 + .030, and tz678 = .146 + .030 

This last tetrad difference is the largest of the sixty-nine which we 
would expect to be zero. Being nearly five times its probable error it 
would generally be regarded as significantly different from zero, but 
being only one of sixty-nine we do not regard such a deviation with 
undue alarm. 
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Another and simpler way to test the sixty-nine tetrad differences is 
to arrange them in a frequency distribution and determine the probable 
error of the whole lot by the Spearman-Holzinger formula given by 
equation 16A in “Abilities of Man.” The standard error by this 
formula is .043 while the probable error is .029. 

In the table below the observed standard deviation of the sixty- 
nine tetrads is .044, the theoretical and observed agreeing almost to 
three decimal places. The mean is also very nearly zero. Professor 
Pearson has lately devised a formula for testing the difference between 
theoretical and observed sigmas, but since they agree in this case 
to within the limits of the accuracy of the data no further test 
is necessary. 


TEeTrRAD DIFFERENCE FREQUENCY 
.14! | 1 
12 1 
.10 
.08 1 
.06 2 
.04 6 
.02 9 
.00 14 

— .02 10 
— .04 14 
— .06 7 
— .08 3 
—.10 1 

69 


Mean = —.0098; SD = .044; PE = .030. 
1 The exact class limits were .1295 — .1495, etc. or class values .1395, .1195, etc. 


The writer would conclude that the above sixty-nine tetrad differ- 
ences arising from twenty-three sets of four variable amply support 
Professor Spearman’s Two Factor Theory. In short, for those vari- 
ables where it is legitimate to use the tetrad criterion it works. The 
remainder of the tetrads in Table I appear to agree very well with 
the expected form assuming equations (1). 

The use of the full probable error formula for each of the sixty-nine 
tetrad differences would be a more exact test, but enough have been 
computed to convince the writer that they are of the order .03 and 
that therefore about four-fifths of all the tetrad differences would not 
be more than twice their probable errors. 
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Such evidence may not convince Professor Kelley for he quotes 
four tetrads on p. 231 of his book with the following comment: “‘ .009 
+ .007, —.015 + .007, .025 + .008, —.019 + .008. Surely one 
should conclude that these four mean tetrads are not chance deviations 
from zero.’’ When one considers that these four tetrads were plucked 
from a table of several hundred, such a comment appears to the writer 
not only incorrect but unfair to Professor Spearman. 

Thus far we have attempted to show that certain quadruplets of 
four variables may be resolved into two factors in accord with Profes- 
sor Spearman’s theory. This does not touch the question as to what 
the a, or as Spearman denotes it the g, factor really is.* Professor 
Spearman has discussed this question in ‘‘Abilities of Man.” Pro- 
fessor Kelley also discusses it to some extent in ‘‘Crossroads in the 
Mind of Man.”’ Thus on p. 110 the latter says: ‘‘ This analysis of the 
a factor leads us to wonder whether, had we experimentally allowed 
for maturity, race, sex, and general nurture, any a or general factor 
would have remained. In other words, we may wonder if there is any 
factor at all independent of these things corresponding to Spearman’s 
idea of a ‘central fund of intellective energy’ or ‘general ability’ as 
‘g.’”? The analysis referred to above is not in the least convincing to 
the writer as will be explained in a subsequent paper. In any event 
the question shouldn’t be left to speculation. 

Professor Spearman’s theory has not been overthrown, but sup- 
ported by Professor Kelley’s data. The multiple factor theory 
initiated by Maxwell Garnett and elaborated by Professor Kelley will 
undoubtedly help us to unravel some of the complex relationships 
existing between mental traits, but the writer at least sees no contra- 
diction between the evidence presented in Professor Kelley’s book and 
Professor Spearman’s theory. 





11It is interesting to note here the correlations of the eight tests with g. 
These values which are the average of several determinations may be set down as 
follows: Pigu-27, T2gm-34) T3gm-08 T4gene28) Tigem-70) T6ge-94y T7gm-91y T8gm0-82> Professor 
Kelleys’ memory tests appear to be full of g. 








NOTE ON PROFESSOR FREEMAN’S DISCUSSION OF 
THE STANFORD STUDY OF FOSTER CHILDREN 


BARBARA 8S. BURKS 


Stanford University 


In the September 1928 number of this Journal, Professor Frank N. 
Freeman and others have published critical appraisals of the ‘‘Twenty- 
seventh Yearbook of the National Society for the Study of Education 
(Nature-Nurture).’’ In Professor Freeman’s discussion of the writer’s 
study of foster children, an inadvertent error of fact and another of 
interpretation have crept in which alter the purport of the study as 
originally published. 

1. Upon the basis of the occupational status of a number of the 
true parents of the group of foster children studied by Stanford, I have 
estimated the hereditary ability of the group to be average or very 
slightly above. That estimates made upon such a basis are valid on 
the average if applied to reasonable numbers of individuals is shown by 
the fact that the ‘‘control”’ fathers and mothers of the “control” 
children studied in our investigation proved to be almost indistinguish- 
able from the foster fathers and mothers in means and dispersions of 
mental ability, although the control parents were selected by us to 
match the foster parents solely on the criterion of occupational status 
of the fathers. 

Professor Freeman writes: ‘‘ Miss Burks makes the rather surprising 
estimate that if the true parents of the foster group were tested, their 
IQ would be found to be slightly above the average. This estimate is 
based on rather meager data concerning the occupations of the true 
fathers and mothers. Concerning the mothers, over eighty per cent 
of whom were unmarried mothers, the estimate is particularly doubtful 
in view of the fact that the largest single group are housemaids.”’ 

Freeman has apparently misread the data upon the occupational 
status of the true mothers of the foster group. Twenty-two, or 
twenty-nine per cent, of the seventy-seven true mothers whose occupa- 
tions are on record were commercial employees such as clerks, book- 
keepers and stenographers; fifteen, or about twenty per cent, were 
listed as students in grammar school, high school, or college; two, or 
2.6 per cent, were teachers. There were sixteen housemaids in the 
group, and these comprised the “largest single group,” not of the total 
number of mothers, but of the particular class under which we listed 
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them (7.e., a combination of thirty-eight individuals in skilled, semi- 
skilled and unskilled occcupations). 

To secure further light on this point I have gone to our original data 
and tabulated the IQ’s of the foster children whose true mothers were 
housemaids. The average 1Q of these children is 101.3. This is six 
points less than the average IQ of the total group of foster children, 
and it is evident that the six points difference cannot be due to a 
difference in environment, inasmuch as the intelligence levels of the 
foster fathers and foster mothers of the foster children whose true 
mothers were housemaids average the same (within a tenth of a year 
MA) as those of the total group of foster fathers and foster mothers. 
We therefore attribute the six points difference to heredity. This 
evidence not only supports the claim that housemaids are not repre- 
sentative of the ‘“‘true mothers’”’ as a group, but provides additional 
data from which to estimate the effect of environment upon our sub- 
jects. If the average 1Q of housemaids may be estimated as about 
ninety, and if the correlation between the intelligence of mothers and 
offspring is about.45 to .50, the predicted IQ of the offspring of house- 
maids should regress five points toward the mean, and is therefore 
about ninety-five. Offspring of housemaids average one hundred one, 
however, when reared in superior foster homes. The effect of environ- 
ment, therefore, is to raise their 1Q’s about six points—a value which 
agrees within a point or two with a similar estimate, for ourentire foster 
group, made in the original study. 

The reader will surely agree that the housemaids were well balanced 
by young women in occupations usually requiring above average 
intelligence. There can be little doubt, also, that the true fathers of 
our foster children averaged up to or above the ability of unselected 
men, for their mean occupational rating exceeded that of unselected 
men in California cities by asignificant amount. These considerations, 
and the additional fact that infants suspected of feeble-mindedness, or 
whose true fathers or mothers were known to be mentally deficient 
were not placed in adoptive homes until old enough for a mental test 
(thus excluding them from our group of subjects, who were placed in 
homes when under twelve months of age), puts the burden of proof 
upon any one who believes the native ability of our foster group to be 
less than average. 

2. In interpreting the significance of data reported in the Stanford 
study. Freeman mentions the fact that the mean IQ of our foster 
group is one hundred seven, while that of our parallel control group of 
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‘“‘own”’ children in homes of the same grade is one hundred fifteen. 
‘“‘T have estimated,” he writes, ‘“‘that the homes were responsible for 
raising the average of the foster group ten points or more. The 
superior heredity of the own children was evidently responsible for an 
additional superiority of eight points or more. This may perhaps 
justify an estimate that the environmental difference under considera- 
tion has as much or more effect than the parental difference in 
heredity.”’ 

It has been shown in the first section of the present note that the 
ten points attributed by Freeman to the influence of home environment 
is probably too high an estimate. A matter of three or four points in 
this estimate is of minor concern, however, as compared with a fallacy 
behind his estimate of the relative contributions of heredity and 
environment. 

The relative contributions of heredity and environment can not be 
measured, as Freeman seems to imply, by their simple effects upon 
group means. The statistical considerations involved are complex, 
but we may state the case briefly by saying that the mean level of a 
dependent variable (such as IQ in out group) becomes harder and 
harder to raise as the proportional contribution of ‘‘causes’’ becomes 
higher and higher. For example, if half of the elements of the depen- 
dent variable were due to one cause, while only a fourth of the elements 
of the dependent variable were due to another cause, we should be 
justified in saying that one cause made only half as much contribution 
as the other cause. But equal deviations of the two causes would 
alter the mean of the dependent variable, not in the ratio of 1 to 2, but 
in the ratio of about 5 to 7 (7.e., in the ratio of the square roots of 4 
and 14). The contributions, however, of these causes to the variance 
of the dependent variable would be in the ratio of 1 to 2.! 

Another reason why it is impossible to estimate the relative con- 
tributions of heredity and environment by manipulating group means 
is that the influence of heredity, in known modes of hereditary trans- 
mission, is always far stronger than parental correlations alone would 
indicate. This is necessarily the case because only a random half of 
the chromosomes of each parent are passed on to any single offspring. 
Hence, the amount of any complex trait possessed by a parent is deter- 
mined by a number of factors beside those which the parent happens 


1 These conclusions are based upon the fact that contributions to the variance 
of the dependent variable are proportional to the squares of the correlation 
coefficients. 
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to contribute to the particular gamete that results in a particular child. 
Heredity, therefore, makes to human differences a large contribution 
that is completely overlooked if we merely compare the mean scores 
of foster and ‘‘own”’ children being reared in similar homes by similar 
parents. 

The view held by man students of heredity, including the writer, 
is that it is necessary to work with the dispersion in human ability 
rather than with mean levels of ability in order to interpret the relative 
potency of heredity and environment. Our Stanford data show that 
if the measurable home environment of our foster families were made 
constant, the variance in the [Q’s of our foster children would be 
reduced by only seventeen per cent. Certainly it seems a far stretch 
of the imagination to attribute equal effects to heredity and environ- 
ment when environment, even in extreme cases, may stimulate or 
depress the 1Q only by about twenty points, while heredity can produce 
alike the idiot of twenty and the genius of two hundred IQ. 








ON THE DETERMINATION OF RELIABILITY IN 
COMPARING THE FINAL MEAN-SCORES 
OF MATCHED GROUPS 


E. F. LINDQUIST AND R. R. FOSTER 


State University of Iowa 


The purpose of this discussion is to point out a statistical fallacy 
commonly employed in the interpretation of a certain type of results in 
educational research. Most research workers in education are familiar 
with the technique usually used to determine the relative effects, 
upon groups of pupils, of two different methods of teaching a given 
unit of subject-matter. This procedure involves the selection of a 
sampling of pupils to constitute an ‘experimental’ group, and 
another sampling of pupils to constitute a ‘‘control” group. The two 
samplings are usually so selected that at the beginning of the experi- 
ment they are evenly matched with respect to all controllable factors 
which might influence their progress, except the particular factor to be 
studied—namely, the method of instruction used. Obviously, one 
of the most important factors to be controlled is the initial ability of 
the pupils in the function under consideration. Therefore, care is 
taken to equate the means and standard deviations of the two groups 
on a valid test of this ability. Then, since both groups begin pre- 
sumably at the same point on the scale of ability, subsequent con- 
conclusions regarding the relative effectiveness of the two methods of 
instruction are based upon a comparison of the mean-scores on the 
same test or on equivalent tests given at the end of the experimental 
period. 

The statistical reliability of any difference obtained between the 
final mean-scores of the two groups is usually computed according to 


the well-known formula for the standard error of differences between 
uncorrelated measures: 


case. = Vor? + oe (1) 
where o; and gz are the standard errors of the two final mean-scores 
between which the difference is found. If the observed difference is 
then found to be three or more times its standard error, it is considered 
probably significant. There is, however, a fallacy in the use of 
formula (1). This formula is quite defensible when used in a certain 


type of problem, but its application in a situation like the one just 
outlined is based on fallacious reasoning. 
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It should be obvious that the investigator in such a situation is 
interested, not primarily in the difference between the final abilities 
of the two groups of pupils, but rather in the difference between the 
amounts of progress which the two groups have made since the initial 
tests were given. He should be concerned not with the final scores 
alone, but with the gains of the two groups as measured by the differ- 
ences between their respective mean-scores on the initial and the final 
tests. It is true, of course, that the difference between the final 
mean-scores obtained is the same as the difference between the gains 
in mean-score. But the standard error of the gain for each group is 
not the same as the standard error of the final mean-score for that 
group. And therefore it follows that the standard error of the differ- 
ence between the gains of the two groups is not the same as the stand- 
ard error of the difference between their final mean-scores (as computed 
by formula 1). Inthe first place, the measure obtained by formula (1), 


Casi. = Vor + 2° 


takes no account of the error of sampling in the zniztial mean-score 
observed for each group. In the second place, it fails to allow for the 
correlation which nearly always exists between successive measures 
of the same individuals or groups. 

Thus it is clear that a formula is needed which will include these 
two factors—a formula which will yield a defensible measure of the 
reliability of a difference between the observed gains of groups in 
mean test-scores. The formula here developed for that purpose 
involves nothing new in statistical theory. It merely applies principles 


which are already recognized in a manner pertinent to the particular 
problem at hand. 


Let 


X, = observed initial mean-score of an ‘‘experimental”’ group of 
pupils, 

X> = observed final mean-score of this “‘experimental”’ group, 

observed initial mean-score of a “‘control”’ group of pupils, 

X,4 = observed final mean-score of this ‘‘control”’ group; 


i. 
I 


M, = true mean-score from which X, deviates by 2, 
M, = true mean-score from which X_ deviates by 22, 
M; = true mean-score from which X; deviates by 23, 
M, = true mean-score from which X, deviates by 24; 
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also, let 
o, = standard error of mean-score X, 
o2 = standard error of mean-score Xo, 
o3; = standard error of mean-score X3, 


o, = standard error of mean-score X 4; 


and finally, let 
ge = observed gain in mean-score of the “experimental”’ group, 
gc = observed gain in mean-score of the ‘“‘control’’ group, 
Dg = observed difference between gains of the two groups. 


Then, 


ge X2- Xi 


or, 
ge = (t%2 + M2) — (41 + Mi) 
or, 
ge = (xe om £1) + (M> = M;). 
Similarly, omitting the intermediate steps, 
gc = (x, — 23) + (M, — M5). 
And since 
Dg = ge — ge, 
we may write 
Dg = (2 — 41) + (M2 — Mi) — (44 — 23) — (My — M3) 
or, 
Dg = (%2 — 21 — 24 + 23) + (Mi — M; — M,+ Ms). 
Then, transposing, 
Dg — (M2 — M, — Mg4+ M3) = (ae — 21 — 4% + 25). 
But 
Dg — (M2 — Mi: — Mg4+ M3) =the deviation of obtained 
difference between gains 
from the true _ difference. 


Therefore, supposing that we have a large number of observed 


differences between gains, we may express the square of their standard 
error thus: 


1 
wilde — (M; — M, — M,+ M;)}? = abt — Zi — % + Zs)° 


or, 


1 
op, = N 


X(x2 = 21 = Za — 13)? 
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Then, squaring in the right-hand member we have 


a= w2(ts? + 22? + 23? + ry? — 2x1%2 — BZXsX, — 


22123 —_ 22 oX4 + 22 2X3 + 223X4). 


But since there can be no real correlation between the scores of 
one group and those of another, 


TD, 


X13, Lol4, XoX3, and 22,4 each = O, 


and the last four terms may be omitted from the equation, thus: 


Tp 


2= ela + 22? + 3? + ry? — 2x22 — Qrsx4). 


wv 
1 
Multiplying through by n= we may write 


D2}? Dx? 2x3" 224? rr 2211X_q 0 —_ BDU 3X 





i= =~. =. = il 
7D, N + N . N + N N N 
But 
Dr? Dae? Dz3" 2x4" 
= = 92; re = 922; = = ¢;7; and = = g,?, 
Also, 
2QUxix2 _ 2( Lx 1X2) ‘iil 2LI3lq 2(2rsr) 








= —_——~“0102} = ae 

N News *”’ N Nose, 
Therefore, we may write 

Tp,” = 01? + a2? + a3? + a4? — 2ri20 102 — 27340 304, 


where rie is the coefficient of correlation between initial and final 
mean-scores of a large number of such ‘‘experimental”’ groups, and 
r34 is the coefficient of correlation between initial and final mean- 
scores of a large number of such “control” groups. 

Then 


cp, = Vo? + o2? + a3? + o4? — 2rj20102 — 2r 340 304 (2) 


As noted above, the r’s in this formula represent correlations 
between obtained mean-scores. But Kelley has shown! that such a 
correlation is equal to the correlation between the two series of indi- 
vidual scores. Therefore, the values to be substituted for the r’s in 
formula (2) are the coefficients of correlation between initial and final 
scores of the individual pupils in each group. 

Formula (2), then, is the one which should be used to determine 
the significance of an observed difference between the gains of two 


1 Kelley, Truman, L.: “Statistical Method.” P. 178, formula 118. 
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groups on the same test or on equivalent forms of the same test. If 
the initial and final scores are highly correlated, the standard error of 
the difference between the gains will be much smaller than the standard 
error of the difference between the final mean-scores (as computed by 
formula (1). If, on the other hand, there is a very low correlation 
between the initial and final measures, the standard error of the 
difference between the gains will be much greater than the standard 
error of the difference between final mean-scores. It may be shown 
that, in general, no serious error will result from the use of formula 
(1), if the coefficients of correlation closely approximate .50, and if 
the standard deviations of the initial and final scores are approxi- 
mately equal for each group. However, if the coefficients are greater 
or smaller than this critical value, an error does result, its magnitude 
depending upon how much greater or less than .50 the coefficients are. 

It should be noted that formula (2) can be used only when the 
initial and final abilities of groups are measured by means of the same 
test or equivalent forms of the same test. If the final test is not 
equivalent to the one used in the initial matching of the groups, the 
final comparison must be limited to the final mean-scores, and formula 
(1) must be used to determine the reliability of the results. Even in 
this type of situation, however, the use of formula (1) is not strictly 
defensible, since there are always errors of sampling in the initial 
mean-scores, and since the probability of correlation between the 
initial and fina! measures is great. Both of these factors, as we have 
seen, are left entirely out of account in formula (1). It appears, there- 
fore, that the investigator of the relative effectiveness of various 
instructional methods should, if possible, arrange his testing program 
so that formula (2) will be applicable to the results he secures. 





THE EVALUATION OF SELF-ADMINISTERING 
SPELLING TESTS 


RUDOLF PINTNER, HENRY D. RINSLAND AND JOSEPH ZUBIN 
Teachers College, Columbia University 


A test to measure spelling that is not administered by the usual 
dictation method, but is “‘self-administering’’ seems to be desirable. 
Such a form has been used by Otis! in his ‘‘ Classification Test’”’ in the 
following form: 


Sample.—Which one of the five words is wrongly spelled? 
1. the 2. when 3. wil 4. same 5. and (3) 


Gates’ nas also used a test which he calls ‘“‘spelling recognition” in 
which the correct spelling is recognized and marked in a group con- 
taining four misspellings of the same word. The following is the 
form used: 

mine nin nyme nine nein 


It is the purpose of this article to examine the validity and relia- 
bility of tests of this nature. The two forms used which seem to give 
most promise are: First, the form used by Otis, sample of which is 
quoted above and which in this study will be known as Alpha; and 
second a form in which the correct spelling of a word with four mis- 
spellings of the word is given at the end of a completion sentence, 
requiring this word for the correct completion, to be known as Beta. 
The form is: 


Sample.—The boy played—————— in the yard. 
(1. bol 2. bal 3. boll 4. ball 5. baul) (4) 


The later form recognizes the fact that many people know a word 
as correctly spelled when they see it written instead of hearing it, and 
that they often write a word to see if it “looks all right.”” Just how 
common this is, is not known, but it is widely used, and has been com- 
pletely neglected in spelling research. It is the problem of this study, 
then, to investigate the relative value of the Alpha and Beta forms 
given above. The criterion selected is the Morrison-MtC€all Spelling 
Scale.* Kelley‘ gives this test a “‘median rating” of one by five 





1 Otis, A. S.: ‘‘Classification Test.’’ World Book Company, 1923, p. 1. 

2 Gates, A. I.: Methods in Spelling. Journal of Educational Psychology, 1926, 
Vol. XVII, p. 293. 

’ McCall, W. A. and Morrison, J. C.: ‘‘Morrison-McCall Spelling Scale.”’ 
World Book Company, 1923, List 1, p. 8. 

‘Kelley, T. L.: ‘Interpretation of Educational Measurements.’’ World 
Book Company, 1927, p. 245. 
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judges. Considerable statistical evidence also indicates the high 
general merit of this scale. 

For this purpose the Alpha and Beta forms were made up using 
exactly the same words in list 1 of the criterion (there are fifty words in 
each list of the Morrison-McCall Scale). In the Beta form the sen- 
tences used were the same sentences as used in the Morrison-McCall 
Scale. 

The Alpha and Morrison-McCall were given to eighty pupils in 
Grades VIA and VIB, and the Beta and Morrison-McCall were given 
to sixty-eight pupils in Grades VIA and VIB. In each case the Alpha 
and Beta forms were given first. 

The mean, standard deviation and standard deviation of the mean 
of all the tests given are shown in Table I. 


TasLE I.—THE MEAN, STANDARD DEVIATION AND STANDARD DEVIATION OF MEAN 
or ALL TEsTs 














Morrison-McCall| Alpha | Morrison-McCall! Beta 
1 2 3 4 
ET Te 35.2 36.2 _ 35.8 38.8 
SRP ovate | 6.3 6.1 5.8 5.4 
SD of mean............) .70 .68 .68 .65 











The mean of the Morrison-McCall is in every case lower than that 
of the experimental forms, while the standard deviation rémains essen- 


tially the same. 


recognition in the two forms of the experimental tests. 
Applying the test for the significance of the difference between the 
means of the four tests, we get the following differences and reliabilities 


as shown in Table IT. 


TaBLE I].—DIFFERENCES OF MEANS AND RELIABILITY 


Evidently the spelling of a word is harder than its 











| Morrison- | Morrison- | Morrison- | Alpha 
| McCall McCall McCall 1 and 
| and Alpha | and Beta and 2 Beta 
| l 
Difference of mean...........| 1 | 3.0 6 2.6 
SD of difference............. .95 | .94 .95 .94 
D/SD of difference.......... | 1.05 | 3.1 .63 2.60 











The difference between the mean of the Morrison-McCall and the 
mean of Beta is evidently not due to chance fluctuations for it is 3.1 
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times the sigma of the difference. The difference between the means 
of the Morrison-McCall and the Alpha may be due to chance fluctua- 
tions, being only 1.05 times the sigma of the difference. The two 
groups tested do not differ significantly in their means on the Morrison- 
McCall, the difference being only .63 of the sigma of their differences. 
The two groups are equivalent so far as the criterion is concerned. 
But the two groups do differ significantly in the means of the Alpha 
and Beta Scores. This indicates that the spelling ability as measured 
by the Morrison-McCall Scale is only partially involved in these two 
tests. The validity and reliability of the Alpha and Beta forms are 
expressed in terms of the following correlations shown in Table III. 


TaBLE III.—CorRRELATIONS SHOWING VALIDITY AND RELIABILITY OF ALPHA AND 
Beta Forms 





> 
- | Correlation with | Correlation with criterion 


Forms | Self-correlatio criterion corrected for attenuation?! 


—— Ce ——— a ee 





ee | 844.024 | .74 + .037 84 + .024 
| | 


1The coefficient of reliability for the Morrison-McCall is given by Kelley: 
Op. cit., p. 315, as .93. 


| 
Alpha........... e+ .022 | .69 + .039 | 78 + .029 





The amount of error present in an individual score is shown by the 
probable error of a raw score and the probable error of an estimated true 
score in Table IV. 


TaBLeE IV.—RELIABILITY OF ALPHA AND BETA ForMS IN TERMS OF THE PROBABLE 
Error oF A Test Score 





| PE (estimated true 








Forms | PE (raw score) 
| | score) 
I als diy wicrrin oy bas oak waa weas ts 1.62 1.46 
hid EN ak ale te eee a rh io ig tim 1.44 | 1.29 





The correlations with the criterion as shown in Table III for such a 
narrow range of abilities (those represented in one year, Grades VIA 
and VIB) are rather higher than the usual correlations for group tests. 
Kelley! reports from a letter by G. M. Ruch from data submitted by 
C. L. Cushman, that the self-correlation of the Morrison-McCall 


1 Kelley: Op. cit., p. 315. 
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Scale in grade VI is .86, N55. The Alpha and Beta forms are almost 
as high being .84.!_ The validity and reliability from this criterion 
seems to be satisfactory. | 

- Another question arises in evaluating such tests against any cri- 
terion. If the criterion is considered a true test of spelling in what 
degree do the Alpha and Beta forms measure the same words as the 
criterion? This would answer an important question relative to the 
diagnostic values of such forms, length being considered equal with 
the criterion. In other words, if it is considered that the criterion is a 
true test, within the limits of the words in the list, how well do the 
Alpha and Beta forms measure the same words as the criterion, or what 
are the chances that a given word spelled correctly in the criterion will 
be spelled correctly in Alpha or Beta. For the purpose of measuring 
the degree of association in which Alpha and Beta measure the same 
words as the Morrison-McCall, four-fold association tables were 
computed. These are shown in Tables V and VI. 


TABLE V.—ASSOCIATION OF WorDS IN ALPHA WITH Morrison-McCALu 





| 


| Numberof | Numberof | 
| words correct | words incorrect | Total 

| Alpha | Alpha 

Number of words correct Morrison- | 
eT: 2742 | 468 | 3210 

Number of words incorrect Morri- | , 
ARS SES 384 | 406 | 790 
a a ah aise cl 3126 | 874 / 4000 

| 





TaBLE VI.—AssociATION OF Worps IN BETA witTH Morrison-McCaLu 





Number of Number of | 

words correct words incorrect | Total 
Beta | Beta 

Number of words correct Morrison- | | 
i ig ea ae a! 2193 | 243 | 2436 

Number of words incorrect Morri- | 
TT SRS ee ee a TF 438 526 | 964 
ld 6 ininnce id 5.40 win cds 2631 | 769 | 3400 


| 





1 If this reliability of .86 is used instead of .93, the correlation with the criterion, 


corrected for attenuation, will be for Alpha .81 and for Beta .87 instead of .78 and .84 
respectively, as given in Table III. 
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In Table V the Q! or coefficient of association is +.72 and in Table 
VI the Qis +.83. This means a high positive association between the 
ability to spell a word correctly as measured by the criterion and the 
ability to recognize the same word in the Alpha and Beta forms. We 
have in this experiment a technique for comparing the ability to 
recognize in its correct and incorrect form with the ability to write the 
word correctly from dictation. It is significant as well as quite logical 
that forty-eight per cent of the words missed on the Morrison-McCall 
were marked correctly in Alpha; and forty-six per cent of the words 
missed on the Morrison-McCall were marked correctly on the Beta. 
Common sense would tell us that there are quite a number of words on 
the periphery of our spelling ability whose correct spelling we recognize, 
but which we cannot spell correctly. It is somewhat more difficult to 
explain why pupils could not recognize the correct form of ten per cent 
of the words in Beta that they know how to spell, and to recognize as 
incorrect seventeen per cent of the words in Alpha that they know how 
to spell. It is likely that laxity in obeying directions, carelessness, and 


the suggestive misspellings used were responsible for a good share of 
these. 


CONCLUSIONS 


1. The correlations between the two experimental forms and the 
criterion are sufficiently high to claim for them validity and reliability 
as group spelling tests. 

2. Form Beta is slightly superior to form Alpha both in correlation 
with the criterion and in the size of the probable error of an individual 
score. 

3. Form Beta yields a considerably higher coefficient of association 
with the criterion. 

4. It is believed that this is a reliable test of that part of spelling 
ability or word knowledge which involves recognition of the correct 
spelling of a word. It measures an ability which almost all individuals 
utilize. It offers a satisfactory technique of measuring spelling by a 

‘‘self-administering’’ or non-dictation method. = 


Se 


1 Yule, G. U.: “An Introduction to the Theory of Statisties.”” Griffin, London, 
1924, p. 38. 
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AN OBJECTIVE MEASURE OF ABILITY TO MAKE 
TOPICAL OUTLINES! 


HELEN FERN DARINGER 


The Lincoln School of Teachers College 


PURPOSE OF THE SCALE 


The following notes describe the construction of a test designed to 
give an objective measure of ability to outline expository paragraphs. 

A general impression among teachers that pupils should be trained 
not only to select the central idea and supporting details in paragraphs 
of content material, but to organize these elements in the proper 
relationships, justly weighted, is strengthened by the judgments of 
such men as Thorndike and Gray. ‘‘ Understanding a paragraph is 
like solving a problem in mathematics. It consists of selecting the 
right elements of the situation and putting them together again in the 
right relations and also with the proper amount of weight and force for 
each. The mind is assailed as it were by every word in the paragraph. 
It must select, repress, soften, emphasize, correlate, and organize, all 
under the influence of the right mental set or purpose or demand.’”? 

Gray suggests the topical outline as one of the most effective means 
to study, and lays great stress on the necessity of training pupils ‘‘to 
pick out the central thought of a selection and to organize the selection 
in terms of main points and supporting details,’ with clear appreciation 
of the relative importance of the various parts.* 

The experience of many teachers is that two very helpful, if not 
necessary, types of study for pupils are, respectively, the preparation 
of asummary which contains the central ideas of the assignments, and 
the preparation of an outline giving principal points and supporting 
details arranged to show order of relative importance and relationships 
to each other. Butterweck found that ‘practice (drill) in organizing 





1Thorndike, E. L.: Reading as Reasoning, a Study of Mistakes in Paragraph 
Reading. Journal of Educational Psychology, June, 1917. 

2 For the suggestion of the problem and for help and criticism without which 
the study would have been impossible, the writer is indebted to Professor T. 
H. Briggs and Professor Percival M. Symonds of Teachers College, Columbia 
University. 

*Gray, W. S.: ‘Relation of Silent Reading to Economy in Education.” 
National Society for Study of Education Yearbook, Vol. XVI, Part I. 

112 





Ability to Make Topical Outlines 113 


gives greater gain in comprehending reading than results from drill in 
vocabulary or in retention.””! 


THE CRITERION 


The criterion used in validating the test was a group of topical 
outlines of expository paragraphs, scored separately by two teachers of 
high school English. The outlines were scored on a scale of twenty, 
with step-intervals of one. The score given to each outline represented 
the teacher’s judgment of the clarity, accuracy, and consistency with 
which the topic or topic sentence of the paragraph and the important 
supporting details (sub-topics) had been selected and related to each 
other. There was a correlation of .879, with a probable error of .024, 
between the scorings of the two teachers. The means of the two scor- 
ings were taken as the criterion. 

The outlines were made by forty pupils of Grade LX of the Lincoln 
School of Teachers College. Each pupil made an outline for each of 
three paragraphs of varied difficulty. 


PRELIMINARY TESTS 


After a trial of a series of seven patagraphs with a number of high 
school students in order to discover the forms which responses might be 
expected to take, eight sets, containing in all a series of twenty-six 
paragraphs, matched in what were judged to be equivalent pairs of 
paragraphs, of varying length and difficulty, were given to each of 
one hundred ten students (an eighth grade, two classes in freshman 
college English, one class in advanced college English), with instruc- 
tions to state the topic of each paragraph, and under it, in tabular 
form, to set down the most important sub-topics. The tests covered 
a period of about three weeks. 

The resulting topical outlines were scored, subjectively, and the 
types of errors and correct responses were noted. The errors with one 
exception were of the kinds expected: Failure to select the central idea; 
failure to discriminate between the central idea and subordinate ideas; 
failure to select one or more of the important supporting details (sub- 
topics); repetition of the same details in different words; confusion of 
unimportant with important details; and the selection as if by chance 
of phrases which, isolated, had no logical relationship to the other 





1 Butterweck, J. S.: ‘‘ Problem of Teaching High School Pupils How to Study.” 
Teachers College Contributions to Education. 
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elements of the outline. One expected error, the mis-ordering of 
subordinate topics (for example, the reversal of the second and third), 
appeared only three or four times in the three thousand eighty outlines, 
and was therefore regarded as negligible when the method of objective 
scoring was devised for the succeeding series of tests. 


THE INVESTIGATION 


Of the thirteen pairs of paragraphs in these first tests, those which 
proved most nearly matched in means of scores were kept. Other 
pairs which in preparation had seemed equivalent showed in use such 
divergences as made it necessary to alter some and to discard others. 
New pairs were added to make a total of twenty paragraphs, arranged 
in four sets of five each, ranging from the very simple to the complex 
type, in order that nobody taking the tests would fail to make some 
score, and on the other hand that nobody could make a perfect score. 

In order to facilitate scoring, all except two of the paragraphs con- 
tained an identical number of sub-topics of the first rank, the ease with 
which the paragraph might be outlined depending not on the number of 
sub-topics, but on the inherent difficulty of subject-matter, the length 
and complexity of the paragraph, the degree of complication of unim- 
portant with important details, the amount of repetition and elabora- 
tion of ideas, and the degree of obviousness of the general topic of the 
paragraph, which might be more or less plainly stated or only implied. 

The scoring of the topical outlines had indicated three factors of 
which any method of objective scoring must take account: 


I. The selection of the controlling idea of the paragraph 
II. The selection of the important supporting details 


III. The exclusion of the unimportant, the elaborating, and the merely repetitious 
details 


I. Every well-unified paragraph presents not only details but a 
whole; it directs all aspects chosen for representation toward one 
interpretation, one idea. The one central idea should be reducible to 
a single sentence. This controlling or summarizing sentence may be 
stated at the beginning or near the beginning of the paragraph, in the 
middle, or at the end; it may be necessary to combine parts of sentences 
to frame a topic sentence; or it may be that the topic is not stated at all, 
but must be inferred by the reader. These various types of presenta- 
tion of the central idea were represented in the twenty paragraphs of 
the series—from the simplest and most obvious, which gave a short 
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clear summarizing sentence at the beginning and again at the end, to 
the most difficult, which implied a centralizing idea. 

For the scoring of this first element of the test (the selection of the 
topic of the paragraph) a multiple response form was devised, with five 
possible choices in order to minimize the element of chance. 

II. Likewise, for the selection of the important supporting details, 
a multiple response form with five possible choices for enh of the 
sub-topics was constructed. 

III. The test may at first sight seem unduly simplified, but perhaps 
it can be shown that any over-simplification is apparent rather than 
real. The making of a complex topical outline requires, of course, the 
choice not only of coordinate details of the first importance but of less 
importance as well. The very act of isolating the important, however, 
is dependent on the ability to repress the unimportant, to select for 
emphasis from all the words assailing the mind those which contribute 
most forcefully and directly to the purpose, and to exclude others. 

Moreover, it seems fair to assume that the ability to select sub- 
topics of the second degree and relate them to one of the first degree is 
similar to that, if not the same, as that, required to select sub-topics 
of the first degree and relate them to a central topic; and that the meas- 
ure of the latter skill is a fair index of the former. 

In addition, the less complex test has two advantages. It can be 
easily administered and quickly scored. Again, very little of the 
content material which high school pupils study is so involved or intri- 
cate as to demand minute analysis into component parts. 

The method used to measure ability to determine the exact number 
of subdivisions in a paragraph, adding none, repeating none, omitting 
none, was simple. Ona slip of paper provided for the purpose, direc- 
tions were given to write down the number of important sub-topics 
found in each paragraph. 

The slips were collected to guard against subsequent alterations. 
The class was then told the number of sub-topics to be selected in each 
paragraph, in order that all pupils might have an equal opportunity to 
choose the correct ones, 


ScoRING THE TESTS 


As has been stated, the preliminary tests revealed three elements 
upon which the rating of the outline depends: The selection of the 
central topic, the selection of the important details, and the exclusion 
of the repetitious and the unimportant. Even after a form had been 
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secured which admitted of the scoring of these three elements, the 
problem remained as to the weight to be attached to each in the com- 
posite score. Different values were arbitrarily assigned to the correct 
responses in each of the three elements of the test until a combination 
was hit upon which made a satisfactory correlation with the criterion. 
These values are as follows: For correct topic sentence, three; for cor- 
rect number of sub-topics, one; for each correct sub-topic, one. Since 
all except two of the paragraphs contained four sub-topics, the maxi- 
mum possible score for any paragraph was eight. The two paragraphs 
which contained only three sub-topics had a maximum possible score 
of seven. 

On four successive days, the different forms of the test were given 
to the forty-two pupils in Grade [X of the Lincoln School. From these 
twenty paragraphs, six were taken as constituting the most nearly 
adequate measure of variations in ability. Fewer than six paragraphs 
would not, it was believed, be sufficient for a reliable measure; more 
than six would be so many as to introduce the factor of fatigue. This 
set of six paragraphs is hereafter referred to as Form X. 

To prevent a knowledge of the number of sub-topics in the first 
test from influencing judgments in succeeding tests, this element only 
was tested on the first day, for all forms of the test. 

The total possible score for Form X was forty-seven. The range of 
actual scores was from seven to thirty-five. The correlation of Form 
X with the criterion was .865 + .026. 

A second form, Y, was secured by a selection of six paragraphs 
from the remaining fourteen. There was a correlation of .865 between 
Forms X and Y. 

The following year, after careful comparison of the responses and 
scores of the separate paragraphs in X and Y, alterations were made 
in four of the paragraphs (two in each form), and the tests in their 
altered forms were given to Grade IX in the Lincoln School. These 
final forms of the test are called Alpha and Beta. The range of scores 


made in Form Alpha was from five to thirty-eight; in Beta, from three 
to thirty-nine. 


EQUIVALENCE AND RELIABILITY OF Forms A anp B—TopicaLt OvurLine TEST 





N SD | Mean rT PEr 
ORES Osea | 4 | 3 | 20.8 | .90 | .o18 
ep Eee ae ery: | 46 Jere ae ee 
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VALIDITY 


Form Alpha has a correlation of .867 with the criterion, with a 
probable error of .026; form Beta, with the criterion .743, with a prob- 
able error of .043. 


GRADE Norms 


The grade norms are tentative only. They are based on four 
hundred cases, taken about the sixth month of the school year. The 
ninth grades of twenty high schools in seventeen towns and cities in 
the East, Middle West, and South were given the test. These were 
taken as constituting a random sample. Because of the time required 
and certain other difficulties of administration, it was not convenient 
to have each class take all forms of the test. The forms were therefore 
divided in approximately equal numbers among the members of each 
class, in the belief that the numbers taking the separate forms were 
sufficiently large to justify the assumption that the groups taking the 
different forms were approximately equal in ability, at least for all 
practical purposes. 


MEAN 
cia raie eG leet Uw ora caiaty mata aaa atte tate ae Cat axa 18.9 
NS hin k6 a Wild bw olde KOK s boo tad eie el ke ee keke bhbn awe dme 17.9 


Time REQUIRED FOR THE TEST 


This is not a time test. It requires about thirty-five minutes to 
administer it. 

The scoring is at the rate of thirty to forty papers an hour, depend- 
ing on the skill of the scorer. 


SUMMARY AND CONCLUSIONS 


In the construction of an objective test for measuring ability to 
make topical outlines, one hundred twenty outlines made by forty 
Grade IX pupils were taken as the criterion. With the.outlines made 
by one hundred ten students for a series of twenty-eight paragraphs of 
varying difficulty as basis for devising an objective method of scoring, 
a multiple-choice test, with five possible responses for the different 
elements of the test, was constructed. The score is a measure of the 
pupil’s ability: (1) to select the central idea of the paragraph; (2) to 
determine the number of important supporting details; (3) their 
number being known, to select the correct supporting details. 
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The multiple choice test was constructed for a series of twenty 
paragraphs, which was given to forty-two Grade IX pupils. From 
the twenty paragraphs, there were chosen six which had a satisfactory 
correlation with the criterion, and gave a satisfactory range of difficulty 
(Form X). The values assigned to the different elements of the test 
for the composite score of each paragraph were determined by trying 
different series of weights and correlating them with the criterion. 
The satisfactory weights were as follows: For topic sentence, three; for 
correct number of important sub-topics, one; for each correct sub-topic, 
one. A duplicate form, Y, was secured by a selection of six from the 
remaining fourteen paragraphs. The correlation between X and Y 
was .865. 

By altering several paragraphs in X and Y, Forms Alpha and Beta 
were secured, which when given to forty-six pupils in Grade IX were 
found to have a correlation of .867 + .026 and .743 + .043, respec- 
tively, with the criterion. The correlation between Forms Alpha and 
Beta is .900 + .018. 

The correlation chart shows that pupils of less than average ability 
are less accurately measured by the objective scale than are the better 
pupils. , 

It is suggested that the test may serve the following purposes: — 

1. Give an objective measure of ability to make topical outlines. 

2. Measure progress in outlining. ’ 

3. Indicate to the teacher wherein pupils need training, whether in 


determining the central idea of paragraphs, or in isolating details, or in 
both. 





THE RELATIONSHIP BETWEEN THE TECHNIQUES 
OF PARTIAL CORRELATION AND PATH 
COEFFICIENTS 


E. LOWELL KELLY 


Colorado State Teachers College! 


In an article in this journal,? Garrett presented a simplified and 
much shorter method of solving multiple correlation problems, the 
equations given being modifications of those of Tolley and Ezekiel.® 
The writer immediately noted a striking similarity between the equa- 
tions presented by Garrett for finding regression equation coefficients 
and those involved in the techniques of path coefficients and coeffi- 
cients of determination, which he had been using in a recent study of 
the relative influence of some of the factors determining school achieve- 
ment and grade location. 

As far as is known, one of the first investigators to apply the tech- 
niques of path coefficients and coefficients of determination in the field 
of educational psychology was Doctor J. D. Heilman of Colorado State 
Teachers College, whose study is reported in Part II of the ‘‘ Twenty- 
seventh Yearbook of the National Society for the Study of Education.” 
In this report, Heilman points out the fact that, for the problem at 
hand, the path coefficients found had the same numerical value as the 
regression equation coefficients when the regression equation was 
written in the special form. It is the purpose of this paper to prove 
that they are identical for all problems, as was suggested by Wright in 
his original articles on these techniques, and also to point out shorter 
procedures. This, however, will necessitate a brief account of the 
path coefficient technique. For a fuller description of the method and 
for the derivation of the formulae used, the reader is referred to the 
original treatment of the subject by Sewall Wright.‘ 


1 Now at Stanford University. 

2 Garrett, Henry E.: A Modification of Tolley and Ezekiel’s Method of Han- 
dling Multiple Correlation Problems. Journal of Educational Psychology, Vol. 
XIX, No. 1, January, 1928, pp. 45-49. 

§ Tolley, H. R. and Ezekiel, M. J. B.: A Method of Handling Multiple Correla- 
tion Problems. Journal of the American Statistical Association, Vol. XVIII, 
December, 1923, pp. 993-1003. 

‘ Wright, Sewall: Correlation and Causation. Journal of Agricultural Research, 
Vol. XX, No. 7, January, 1921, pp. 557-585. 

Also Wright, Sewall: Systems of Mating. I. The Biometric Relations between 
Parents and Offspring. Genetics, Vol. VI, No. 3, March, 1921, p. 111. 
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Suppose we have four variables, X:, X2, X3, and X,4, all intercorre- 
lated as shownin Fig. 1. Let X; be the dependent variable or criterion, 
and Xe, X3, and X, the independent variables, all of which have a 
direct influence on X; as shown by the paths a, b, and c, and indirect 
influences by the way of chains of paths such as fc, db, da, and dfc. 
The independent variables also have direct and indirect influences on 
each other. The ordinary approach to the solution of such a problem 
is by means of the partial correlation and regression equation tech- 
niques, by which we are enabled to express the relation existing between 
X;, and the independent variables. The technique of coefficients of 
determination allows us to express the amount of influence each 
variable has on the other in terms of per cent and also has the advan- 
tage of giving the amount of the combined influence that any two 
variables such as X_ and X3 may have independent of their direct 
influences. 

Wright defines the path coefficient as the ratio of the standard 
deviation of one variable due to another variable to the total standard 
deviation of the first variable. Assuming that the direct influence 
along any one of the paths may be measured by the amount of varia- 
tion still existing in X, after the effect of all other possible paths of 
influence is eliminated, we may say that the value of (a) is equal to the 
amount of variation in X, due to X2 to the total amount of variation 
in X;. Using the standard deviation as the measure of variation, and 
representing the path coefficient (a) by pz,..,, we get the equation: 
a = Dz,.2, = — *, the general equation being px.4 = si In the 
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case of a two variable problem, the path coefficient is equal to the 
coefficient of correlation existing between the two variables, since all 
the measured variation in X, is due to X2. 

Wright shows that for more than two variables, the correlation 
between any two of them is equal to the sum of the direct path con- 
necting them, plus the products of all the combinations of paths by 
which they are indirectly connected. Using this principle, it becomes 
possible to write the following equations for the solution of such a 
problem as the one represented in Fig. 1: 


(1) ree = d 

(2) rs4 =e+df 

(3) roa = f +e 

(4) re =a +db+fe + dec 

(5) ris = b + da + ec + dfc 

(6) rig = c + af + be + ade + bdf 
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Substituting the values of r which are at hand, and solving the 
equations simultaneously, the values of the path coefficients are 
quickly and easily found. 

The coefficient of determination,' d, which measures the fraction of 
complete determination which one variable exerts upon another, is 
then obtained directly from the path coefficients. Thus the fractional 
amount of influence that X-2 exerts on X; is obtained by squaring (a); 
the fractional amount that X, exerts on X, is obtained by squaring (f), 














Fig. 1. 


and so forth, the general formula being dx.4 = p*x.4. The fractional 
amount of the combined influence of any two independent variables is 
equal to two times the product of the direct paths connecting them 
with the criterion times the coefficient of correlation between them, 
the general formula being dx.4s = 2px.aPx-s’as. The fact that the 
sum of dx., and dx.,4z is equal to the coefficient of multiple correlation 
squared or that the amount of influence left unmeasured which is repre- 





1 Wright: “‘Correlation and Causation.” Op. cit., p. 562. 
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sented by dx.y equals (1 — R*) is proof per se of the validity of this 
technique. 

This method of handling multiple correlation problems is much 
shorter and yields more valuable results than the ordinary method of 
partial correlation and the regression equation. Its one bad feature is 
the difficulty of including all and only the proper paths of influence. 
Writing the necessary equations becomes a very difficult procedure 
when more than four variables are concerned. By proving the identity 
of the equations given by Garrett, and those written for the solution of 
the problem in Fig. 1, the process of writing the equations from the 
diagram, or even without it, would become a relatively simple matter. 
Garrett gives the following equations! for a four variable problem such 
as the one under consideration: 


(7) rie = RieBie + roshisBis + raRisBra 
(8) ris = reskti2Bie + RisBis + TeahisBis 
(9) rig = rosRieBre + rasRisBis + RisBre 


o3 


. , o2 4 
in which Rie = aa Ris = o.? and Ris = . - and Bie = bie.343 Bis = 
1 1 1 


bisz-24; and Big = bi4.23 of the regular regression equation. Since the 
regression equation coefficients of the special form of the regression 
equation are the same as those for the regular form except that the 
sigmas are equalized, the expressions R12Bi2, Ri3Bi3, and Ri4Bi, repre- 
sent the regression equation coefficients of the special form of the 
equation. This is true because, equalization of the sigmas makes 
Ri0, Ris, and Ria all equal. 

Assuming that the path coefficients and these regression equation 
coefficients are identical, we may let a = Ri2By2; b = Ri3Bi3; andc = 
RuBi (Fig. 1). Substituting a, 6b, and c in equation (7), we have 


(10) rig = @ + resb + rose 


but from equations (1) and (2) we know that re; = d, and rag = f + 
de. Substituting these values in equation (10), it becomes 


re =a+db+ (f+de)c;orrs =~a+db+fe+dee 


which is exactly the same as equation (4) already given. The identity 
of the other two equations may be proved in the same manner, and we 
are forced to conclude that the path coefficients and the regression 
equation coefficients for the special form of the regression equation are 





1 Garrett: Op. cit., p. 48. 
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the same since they are found by the solution of identical formulae. 
This might be inferred from a careful reading of Garrett’s definition 
of the special form of the regression equation: ‘‘ . . . enables us to 
determine (from correlation alone) the relative weight with which each 
independent factor ‘enters into’ or contributes to the dependent 
variable (the criterion) independently of other factors.’’! 

Knowing that the equations given by Garrett may be used for find- 
ing path coefficients as well as regression equation coefficients, we are 
enabled to write the necessary equations for solving a problem by the 
path coefficient method in a much shorter time, and also with greater 
certainty that itis complete. Garrett’s formule as given in the above 
mentioned article are directly applicable for any number of variables, 
providing the path connecting X, and X-2 is always substituted for 
R2B12; the path connecting X, and X; for Ri3B,3, and so forth. When 
no relationship exists between any two variables, the correlation 
between them, as well as the path connecting them is zero, and the 
formule are still applicable. In case one desires to find the paths 
which do not connect directly with the criterion, it is only necessary to 
substitute one of the independent variables as a temporary criterion, 
renumber the others, and then use the same formule as before. It 
must be remembered, however, that correlation between any two 
variables does not necessarily presuppose a causal relationship of one 
on the other. The direction of influence of the various paths must 
therefore be assumed on a priori grounds or else determined by other 
means. 


Garrett also gave the following formula for calculation of the coeffi- 
cient of multiple correlation: 


reel Bief 120 12 + Bisrisois + Burigut ssn 


01 





R? 234 oe « nm) 





. o2 o3 O4 : 
However, since Ri: = a4 Ry; = at and Ry, = ;, we may substitute 
1 1 1 


obtaining 
Risa... nw) =RigBierig + RisBisris + RuBurid+ +--+: n 
but 
Rw2Biz = a; RisBis = 5; and RisBis = ¢, 
and so forth, so 


R* i234 oto). = ar ie + bris -+- Cri4 a ce 2 Nn 





1 Garrett, Henry E.: Statistics in Psychology and Education. Longmans, 
Green and Company, New York, 1926, p. 256. 
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In other words, the coefficient of multiple correlation is equal to 
the square root of the sum of the products obtained by multiplying each 
of the path coefficients connecting the criterion and an independent 
variable by the coefficient of correlation between the same variables. 
This greatly reduces the labor of finding the multiple coefficient, since 
the values of the various sigmas need not be taken into consideration. 
The standard error of estimate may then be obtained by substituting 
in the formula given by Garrett: 


Oest.X, = Vo2(1 a RR? 1,234 — a) 


Due to the fact that only two simultaneous equations are involved 
in the solution of a three variable problem, it is possible to derive simple 
formule for the two path coefficients and the coefficient of multiple 
correlation involving only the zero order coefficients of correlation. 
By an algebraic solution of the two equations, we get: 





Tig — Tisl23 


1 — 1’; 
and 
Tig — Ty9Pe: 
= 
Substituting the above values for a and 6 in the equation 
Rx23) = Varis + bris 
and reducing to lowest terms gives the following formula for the 
multiple coefficient for a three variable problem: 


R13) = ee + 77,5 — 2rroPisles 


1 — r3., 








This formula will be found valuable, especially if the path coeffi- 
cients are not wanted, since it enables one to calculate the multiple 
coefficient directly from the zero order coefficients. Since it is the 
same as the equation resulting from the solution of a three variable 
problem by means of determinants, we have additional proof of the 
identity of the values found by the two techniques under discussion. 





“MYSTERIOUS” TROPISMS: AN ILLUSTRATION OF 
THE GESTALT LAW OF PRECISION 


MARION E. GRANT 


Baylor College, Belton, Texas 


Members of the Gestalt school of psychologists state as one of the 
laws of configuration that configuration patterns ‘‘constantly vary in 
the direction of greater simplicity, stability and precision.’’"' The 
following account records an experience which seems to bear out this 
theory when the configuration is a definition containing a word whose 
meaning is relatively unfamiliar. 

During a lecture at the beginning of a course in Educational 
Psychology, the writer stated a definition of the word “tropism,” 
which began: ‘‘A tropism is a stereotyped form of behavior . . . ” 
Assuming that her class was of average intelligence and verbal experi- 
ence (a dangerous assumption on the part of any teacher!), no effort 
was made to explain the meaning of the word, stereotyped. A brief 
test was given to the class about three days later and one question 
called for the definition of a tropism. What was the surprise and dis- 
may of the examiner to find on one of the first papers graded the 
statement: “‘A tropism is a mysterious type of behaviorism;” and, on 
another: ‘‘A tropism is an exterior form of behavior.’”’ Other papers 
revealed still other interesting and amusing modifications of the 
original definition. 

In order to determine what further modifications might take place, 
at the next regular meeting of the class three days later, the students 
were called upon to write again a definition of “‘tropism”’ and to state 
whether or not they had looked up or discussed the definition in the 
meantime. The original definition was then re-stated, and the mem- 
bers of the class were asked to refer to their previous notes and to write 
down the definition as they had first taken it down in their notes. 
In this way, three forms of data were secured: (1) The original inter- 
pretation of the definition as taken in class notes; (2) the re-statement 
of it on the test; (3) the re-statement of it after a further interval of 
three days. 

The following table shows the statements obtained from ten of the 
pupils, the data being in the order stated above: 





1 Ogden, R. M.: “‘Psychology and Education.” p. 238. 
125 





lj 
| 


Ss 








126 The Journal of Educational Psychology 


I. (1) Some form of behavior on the outside. 
(2) The behavior on the outside. 
(3) The outward behavior of the lower organisms. 
II. (1) Some exterior behavior. 
| (2) A form of exterior behavior to certain types of stimuli. 
(3) A form of outward behavior. 
III. (1) The mysterious type of behaviorism. 
(2) The mysterious type of behavior. 
(3) The mysterious type of life. 
IV.1 (1) Some superior response of behavior. 
(2) A permanent change in outer behavior. 
(3) A permanent change in outward behavior. 
V. (1) Some certain form or type of behavior. 
(2) A certain or particular type of habit. 
(3) A special or certain type of habit. 
VI. (1) Some form of outward behavior due to some stimuli. 
(2) The behavior of the lower organisms. 
(3) The outward behavior of certain lower organisms. 
VII. (1) Some exterior form of behavior. 
(2) Outside reaction. 
(3) An outside reaction. 
VIII. (1) A mysterious reaction. 
(2) The mysterious outer reaction of some lower animals. 
(3) The mysterious outer reaction of some animals. 
IX. (1) A certain kind of behavior. 
(2) A type of habit. | 
(3) Particular type of habit. 
X. (1) Some exterior form of behavior. 
(2) An external response to a sensation or stimuli. 
(3) The external response to a stimulus. 


Of the students whose answers are not recorded, many were quite 
familiar with the word, and recorded the definition verbatim. For 
them the configuration was already relatively simple and stable. 
Some others avoided the long word, stereotyped, but translated it as 
definite, certain, or fired—again a change in the direction of greater 
simplicity. 

CONCLUSIONS 


1. That the Gestalt law of precision is illustrated by the above 
examples is shown in the changes which took place in the form of the 
definition. An attempt was made to put it in terms which were 





‘Student No. IV indicated that, on conversation with another student, she 
took her definition, ‘‘exterior form of behavior,’’ but this is decidedly modified in 
(2) and (3). (Seemingly, too, it was a case of taking more than her friend could 


spare, since the latter failed to reproduce any definition whatever!) 
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already familiar in the student’s experience, and therefore more stable 
and simple. 

2. Judging from the experience recounted above, it is an unwar- 
ranted presumption on the part of a college teacher to expect more than 
sixty-five per cent of a class of sophomores to have a sufficiently large 
vocabulary to include the word stereotyped! 

3. This experiment indicates the necessity of couching definitions 
in simple terms and of insisting on very frequent drill and review on 
material to be mastered. It is, I fear, typical of the halfknowledge 
(or worse) that many of our students carry away with them after an 
hour of supposedly lucid and educative lecturing. 
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THE EFFECT OF THE ORDER OF PRINTED RESPONSE 
WORDS ON AN INTEREST QUESTIONNAIRE 


C. 0. MATHEWS 


Ohio Wesleyan University 


The writer recently! called attention to the fact that the order of 
printed response words is one of the influencing factors upon the 
answers obtained on tests with alternate responses. A response word 
when printed above its alternative was marked 33.8 per cent more often 
than when it was printed below its alternative. A response word when 
printed to the left of its alternative was marked 3.2 per cent more often 
than when it was printed to the right of its alternative. 

To what extent will the answers on a rating blank with five possible 
responses be influenced by the order in which the descriptive words are 
printed? To secure evidence on this question was the purpose of this 
investigation. 

A questionnaire of fifty items was devised to discover the attitudes 
of pupils toward various study habits. It was printed in two forms. 


The directions and a few items of Form A are given to illustrate the 
arrangement. 


Directions.—Fifty phrases are given below. Each one has something to do 
with methods of study. You are to indicate how well you like each one by under- 
lining the appropriate answer under each phrase, like this: 

A. To go on class excursions. 

like greatly like indifferent dislike disliked greatly 
Mark only one answer for each item. ‘‘Indifferent’’ means that you neither 


like nor dislike the thing. You know what the other words mean. Do not skip 
any items. 


1. To look up references on your lessons. 


like greatly like indifferent dislike dislike greatly 
4. To make an outline of your lesson. 


like greatly like indifferent dislike dislike greatly 
7. To take notes during a talk or recitation. 
like greatly like indifferent dislike dislike greatly 
32. To know your grade on a test. 
like greatly like indifferent dislike dislike greatly 
50. To have your teachers tell you how to study. 
like greatly like indifferent dislike dislike greatly 





‘ Journal of Educational Psychology, Vol. XVIII, Oct., 1927, pp. 445-457. 
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Form B was identical to Form A in every respect except that the 
response words following each item were printed in the reverse order, 
that is: 


dislike greatly dislike indifferent like like greatly 


That the opinions concerning the degree of interest expressed by 
the pupils on the items of these blanks is not due wholly to chance is 
shown by the reliability coefficients. Reliabilities were computed after 
numerical values were assigned to the degrees of interest marked. 
The values used were: 


like greatly........ +2 dislike greatly....... —2 
lid ara To +1 Indifferent.......... p>. GI aie d. 4.6 6 0'6 6's —1 


This does not assume, of course, that it is any more desirable to like 
any activity than it is to dislike it. Coefficients were first computed 
by comparing the scores on the two halves of a blank and then Brown’s 
formula was used to estimate the reliability of the complete blank. 
That of Form A is .75 and of Form B .79 when computed from the 
responses of the one hundred thirty college students mentioned later. 

The blanks were first given to one hundred eighty-four Junior High 
School girls in a large New York City school as a part of another testing 
program in progress in the school. The blanks were so distributed 
that half of the girls marked Form A first and the other half marked 
Form B first. Immediately afterwards Form B was distributed to 
those who had marked A, and Form A distributed to those who had 
marked B. In passing out the second form it was explained that this 
test was somewhat like the former one but not the same test. That 
many of these pupils did not detect the difference is shown by the fact 
that many were quite positive that the test was the same, calling atten- 
tion to the examiner that he was passing out the wrong test. They 
were urged to omit no items, but to decide how much they liked or 
disliked each activity or procedure mentioned. In spite of this 
admonition there were thirteen omissions on Form A and seven on 
Form B. This accounts for the total number of responses for each 
form as given in Table I, instead of 9200, as we otherwise would expect. 

This table shows the differences between the percentages of similar 
responses on Forms A and B, made by one hundred eighty-nine Junior 
High School girls. For example, there were 2990 “Like greatly” 
responses on Form A. This was 32.5 per cent of the total number of 
responses made by these girls on thisform. There were 2717 responses 
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on Form B, which is 29.6 per cent of all Form B responses. The 
difference between these two per cents is 2.9infavorof Form A. The 
probable error of this difference is .8 and the ratio of the difference to 
its probable error is 3.6. 


TaBLE ].—THE DIFFERENCES BETWEEN PERCENTAGES OF SIMILAR RESPONSES ON 
Forms A AND B Maps sy OnE Hounprep Eicaty-Frour Junior Hiex 
Scuoo.t GIRLS 





























Responses Responses | Difference , 
on Form A | on Form B | between | PE | Difference 
divided 
Type of response per cents | differ- by PE 
| Num-| Per | Num-} Per | on Forms | ence P< wn 
ber | cent | ber | cent | AandB 
Like greatly....... | 2990 | 32.5 | 2717 | 29.6 +2.9! 8 3.6 
ck Sip sw 4s we | 2348 | 25.6 | 2604 | 28.3 —2.7 .8 3.4 
Indifferent........ 1748 | 19.0 | 1685 | 18.3 + .7 8 9 
a 1425 | 15.5 | 1288 | 14.0 +1.5 9 1.7 
Dislike greatly...... 676} 7.4| 899] 9.8| -2.4 | .9 2.7 
All responses... . . | 9187 100 | 9193 |100 ! 











1 These differences are plus when the largest per cent occurs on Form A and 
minus when it is on Form B. 


From this table it can be seen that “‘Like greatly”’ was marked 2.9 
per cent more often when it was printed at the extreme left than when 
printed at the extreme right. ‘Dislike greatly” was marked 2.4 per 
cent more often when it was printed at the extreme left. ‘‘Like” and 
“Dislike,” however, both were marked more often when they occurred 
in the fourth position from the left than when in the second position 
from the left. The statistical significance of these differences is shown 
by the ratio of the differences to their probable errors. 

As a check on the findings shown above both forms of the blanks 
were given to one hundred thirty college students. The procedure in 
administering the blanks to these students differed in several respects 
from that used with the Junior High School group. 

The blanks were given during regular class periods and the one 
hundred thirty students were in four different classes. Form A was 
given first to all groups and Form B followed in exactly one week. 
The differences between the percentages of similar responses on the 
two forms were computed in the same way as they had been for the 


Junior High School group. Table II shows these facts for the one 
hundred thirty college students. 
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TaBLeE I].—DIFFERENCES BETWEEN PERCENTAGES OF SIMILAR RESPONSES ON 
Forms A anD B Maps spy Ong Hunprep Tuirty CoLiece STupENTs 


























Responses Responses | Differ- 
on Form A | on Form B | ences PE Difference 
Type of response WAR PPE Wa gra wel | between differ- divided 
| per cents by PE 
| Per | Num-| Per | on Forms ence | difference 
ber | cent ber | cent | Dead 
Like greatly........ 1568 | 24.3/1113/ 17.1, +7.2 | 1.0 | 7.2 
APR eee | 2030 | 31.1 | 2388 | 36.0; -4.9 | 9 | 5.4 
Indifferent........ 1149 | 17.7 | 1217 | 18.8 Ea Toco ibis. 
SB ai cece ccs 1292 | 19.8|1283\/19.7! +.1| 101] 1 
Dislike greatly..... 450 7.1| 544/ 8.4| -1.3 | 1.1 | 1.2 
All responses... ... | 6498 100 | 6495 |100 | | 
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The signs of the differences between Forms A and B are the same as 
they were in the results from junior high school pupils shown in Table 
I, with the exception of that of ‘‘Indifference.’”’ Only two of these 
differences, however, are statistically significant. These same two 
differences are also the most reliable ones in Table I. 

The most evident conclusion from the results shown in these two 
tables is that the effect of printed position is greatest and most reliable 
upon “Like greatly” and ‘‘Like”’ responses. The largest per cent of 
“Like greatly’’ responses occurred on Form A in which this response 
was printed at the extreme left position. The largest per cent of 
“Like”? responses occurred in Form B in which this response was 
printed in the fourth position from the left. 

Similarly it is shown that a greater number of “Dislike greatly”’ 
responses were marked when it was printed at the extreme left position 
than when at the extreme right. More “ Dislike’? responses were 
marked when it was printed in the fourth position. These facts hold 


true for each of the two groups of students to whom the tests were 
given. ‘ 


VV 


What are the causes underlying such effects which seem to be pro- 
duced by printing a response word in one position in an order rather 
than in another position? Reading habits probably have an influence. 
In dealing with items in which there is no word or symbol of pro- 
nounced potency, more chance factors are probably at work in causing 
changes from test to test than in items on which one has very definite 
opinions. 
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Suppose for example, we would include such an item as “How 
would you like to commit murder?” It is inconceivable that any 
normal person would be influenced very much in his response by the 
fact that ‘Dislike greatly” came fifth instead of first in a series of 
possible responses. 

Among our items, for example, was one, “To study in a well venti- 
lated room.’’ Every one of our one hundred thirty college students 
marked the response to this item “‘Like Greatly” or “Like.” Eighty- 
one of them gave the former response and twenty-two the latter on 
both forms of the blank. The remaining twenty-seven students 
changed from one of these responses to the other on the second form. 
There were fewer changes on this item than on any other one. The 
exact changes can be seen from the data in Table ITI. 


TasLeE III.—Comparison oF Pupits’ RESPONSES ON Forms A AND B TO THE 
Item “‘To Stupy in A WELL VENTILATED Room” 

















Form B 
Form A a en - 
Like greatly | Like | Indifferent Total 
l l 
CSE ee | 8 | 21 | st | 29 
GN ah) a a ‘2 Gk 
ts alae | 89 | 40 | 1 | 130 


| 


With items on which individuals do not have such pronounced 
views, one would expect a greater variation in the responses of various 
individuals and also more changes in responses when this item is 
presented a second time. In such items the position of printed 
response words probably would have a greater influence upon the 
answers indicated. For example, the most variable item on this 
questionnaire was, ‘‘To look up references on your lessons.’”’ Table 
IV shows the number and degree of changes made by the one hundred 
thirty college students to this item. 

Such facts show how variable the changes are. Further evidence 
that these changes are somewhat due to the effect of printed position 
of responses may be obtained by comparing the number of times each 
response was marked in various positions. ‘Indifferent’ held the 
same relative position in the printed responses on both forms of the 
test. In the Junior High School group a few more of these responses 


were marked on Form A and with the college group a few more on 
form B. 
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TaBLE I1V.—CoMPARISON OF PuPILs’ RESPONSES ON Forms A AND B To THE ITEM 
‘To Loox Up REFERENCES ON Your LESSONS”’ 

















Form B 
Form A ‘ae “Pea ; sin 
Make | rake | tM | tics | OM | Total 
greatly | ently greatly 
Dislike greatly......... | - | 1 1 La 2 
ERE aoe | 5 ol eee 28 
0 eae re ae | 9 9 1 34 
Ms oy ha cek bake et ae Sok a, Cee eek ek 50 
Like greatly............ | 9 | 5 | 2 oa 16 
|S Re eee 3 | 57 | 29 | 34 7 130 











But when we compare the other four responses in different positions 
we find results as shown in Table V. These are figures taken from 
Tables I and II placed in a different arrangement. The word alter- 
native is used to indicate one or the other member of each pair, “‘Like 
greatly” or ‘‘Like” at one end of the scale and “ Dislike” or “ Dislike 
greatly” at the other end. 


TABLE V.—NvuMBER oF Times Eacu RESPONSE WasS MARKED IN Eacu or Two 
PRINTED POSITIONS IN RELATION TO ITs ALTERNATIVE 














: ‘ 
|  Typeof | Toleftof | To right of DiSerence in 
Group | ; favor of left 
| response | alternative | alternative +e 
| | position 
One hundred eighty-| Like greatly 2990 2717 273 
four Junior High | Like 2604 2348 256 
School students Dislike 1425 1288 137 
Dislike greatly 899 676 223 
One hundred thirty | Like greatly 1568 1113 455 
College students Like 2338 2030 308 
Dislike 1292 1283 _ 9 
Dislike greatly 544 459 “7 85 

















Every response was marked more often in the first position, 7.e., 
to the left of its alternative, than it was in the second position. Some 
of the differences are small and do not meet the criterion of statistical 
significance, but probably are significant in the ordinary sense, in that 
they all favor the same printed position. 
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To what extent such effects of position are due to reading habits or 
to a prepotent influence of the first response of a pair it is not known. 
Our data show that there is great variation among individuals as to 
their tendency to be influenced by the position of printed responses. 
For this reason it would be impossible to work out a correction which 
would be equally applicable to the responses of all individuals. 

If we would assume that the per cent of individuals who marked a 
given response to an item was a true measure of the per cent who 
really preferred this response we might make quite anerror. One way 
this error probably could be avoided, so far as it is caused by position 
of printed responses, would be to let the individuals indicate their 


preferences by use of a word or symbol rather than employing printed 
responses. 


SUMMARY 


It has been shown that the position of a printed response on a 
questionnaire with five possible responses influenced individuals’ 
answers to items on the blank. The amount of this influence is indi- 
cated by the fact that a given response was marked from .1 to 7.2 per 
cent more often in one than in another position. With the responses 
used in this questionnaire, position one (farthest to the left) was influ- 
ential over position two, and position four was more potent than posi- 
tion five. This influence of position probably could be avoided by 
requiring the individual to respond by writing a word or symbol 
rather than printing the responses on the blank. 





THE “SIGHT READING” METHOD OF TEACHING 
READING, AS A SOURCE OF READING 
DISABILITY 


SAMUEL T. ORTON, A.M. M.D. 
New York City 


I feel some trepidation in offering criticism in a field somewhat 
outside of that of my own endeavor but a very considerable part of 
my attention for the past four years has been given to the study of 
reading disability from the standpoint of cerebral physiology. This 
work has now extended over a comparatively large series of cases from 
many different schools and both the theory which has directed this 
work and the observations garnered therefrom seem to bear with 
sufficient directness on certain teaching methods in reading to warrant 
critical suggestions which otherwise might be considered overbold. 

I wish to emphasize at the beginning that the strictures which I 
have to offer here do not apply to the use of the sight method of teach- 
ing reading as a whole but only to its effect on a restricted group of 
children for whom, as I think we can show, this technique is not only 
not adapted but often proves an actual obstacle to reading progress, 
and moreover I believe that this group is one of considerable educa- 
tional importance both because of its size and because here faulty 
teaching methods may not only prevent the acquisition of academic 
education by children of average capacity but may also give rise to far 
reaching damage to their emotional life. 

The sight reading method (or “look and say”’ of the English) has 
been credited with giving much faster progress in the acquisition of 
reading facility than its precursors and this statement I will not chal- 
lenge if the measure of accomplishment be the average progress of a 
group or class. Average progress of large numerical units, however, 
makes no allowance for the study of effect in individuals, particularly 
if certain of them deviate to some degree from the others in their 
methods of acquisition and therefore in their teaching requirements. 
To the mental hygienist whose interest is focussed on the individual 
and his problems rather than on group progress the results as deter- 
mined by average accomplishment are of little value whereas the effect 
of a given method on the individual child is all important. 

Outstanding cases of so-called “‘congenital word blindness”—a 
complete inability to learn to read—have been recognized and studied 
for a number of years at first chiefly by physicians. It has also been 
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recognized by teachers and psychologists that there is a large group cf 
children who have a much greater difficulty in getting started in read- 
ing than would be expected from their ability in arithmetic, from their 
ease in auditory acquisition and from their general alertness. In the 
past there has been a tendency, at least among medical men, and 
to a considerable degree among psychologists as well to exclude the 
minor cases of slow learning in reading from the category of congenital 
word blindness. This largely derives from the work of Hinshelwood! 
who made the first extensive study of these cases following the pioneer 
work of Kerr? and Morgan.* Hinshelwood’s statement in this is 
«|. . the rapidity and ease with which children learn to read by sight 
vary a great deal. No doubt it is a comparatively common thing to 
find some who lag considerably behind their fellows, because of their 
slowness and difficulty in acquiring their visual word memories, but I 
regard these slight defects as only physiological variations and not to 
be regarded as pathological conditions. It becomes a source of con- 
fusion to apply to such cases as has been done of late the term of ‘con- 
genital word blindness’ which should be reserved for the really grave 
degrees of this defect which manifestly are the result of a pathological 
condition of the visual memory center and which have proved refrac- 
tory to all ordinary methods of school instruction.”” Unfortunately, 
Hinchelwood’s criterion is a double one, neither part of which can be 
looked upon as of sufficient diagnostic accuracy to establish a clear cut 
entity. Not only has no pathological condition of the visual memory 
center yet been substantiated in such cases but there are certain 
neurological and clinical data which suggest that no such condition 
exists. Again, the ‘‘ordinary methods of school instruction” does not 
prove to be an accurate measure. Such methods vary widely and our 
own figures indicate that the number of children who show a significant 
handicap in reading is to some degrees related to the teaching method 
in use. Bachmann‘ has called attention to the looseness of the con- 
cept of congenital word blindness and related to this the striking varia- 
tion in the frequence of such cases as recorded by various authors. 
Without some fairly clear objective symptoms on which to establish 
the entity, the choice of cases to be included naturally rests on the 
judgment of the examiner as to the severity of the disability. My own 
initial work® in this field led to a firm conviction that we were dealing 
here, not with two separate groups—a physiological and a pathological 
—but that those children who were specifically retarded in reading (thus 
excluding cases of general mental defect) formed a graded series extend- 
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ing from the normal to the extreme and that they showed consistent 
characteristic performance which not only would serve for diagnosis 
but which also was highly suggestive of the reason for their lack of d 
progress and which gave excellent cues to methods for retraining. I 4 
was convinced not only that the specific reading disability formed an } 
entity of much greater numerical importance than had been recognized | 
before but that it was (even in the extreme cases) an obstacle of a 
physiological nature rather than a pathological condition and that 
therefore adequate special methods of teaching should correct it. 

I can not here go fully into the details of the anatomical back- 
ground for our present theory of this disability but some presentation 
is necessary in order to illustrate the basis for the criticism of teaching 
method which is here offered. 

Only a small portion of the retina of the eye is used in acquisition 
of reading. This is the focus of central vision or the macula lutea, so 
called because it is seen as a yellow spot in ophthalmoscopic examina- 
tions. The rest of the retina receives only general and less detailed 
impressions coming from outside the rather small area to which we are 
directing our attention. This point is noteworthy because the nervous 
connections of these two divisions of the retina are quite unlike. The 
peripheral retina or outer zone has connections with only one-half of 
the brain (there are some complexities here but these need not concern 
us). The macula lutea, however, which receives impressions with 
greatest detail and which is hence used exclusively in learning to read, 
has a double connection with the brain. The nerve fibers arising here 
divide and one-half of those starting from each macula goes to the visual 
area of the hemisphere of the brain of the same side and the other half to 
the corresponding area of the opposite hemisphere. ‘Thus impressions 
received by either eye, or by both eyes, are relayed simultaneously to 
both hemispheres of the brain. This double implantation does not 
give us a double sensation in consciousness, however, as a touch on 
both thumbs would do. The simultaneous activity of both areas 
results in our seeing but a single image.’ The visual sensation, how- 
ever, is not a unitary function. There is apparentlyneed for ‘the 
simultaneous or additive activity of several parts of the visual cerebral | 
mechanisms to complete the linkage of a printed symbol with its ip 
meaning and the steps in this process are shown in relief by differential : 
losses such as are seen when certain parts of the back of the brain are 
destroyed by disease. When all of that part of the brain which has 
to do with vision is destroyed the individual becomes totally blind. 
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The eyes, however, are not damaged and they can still be moved and 
they will turn toward a sudden sound and the pupils will respond by 
closing and opening to increase and decrease of the amount of light 
which strikes them. This condition is known as cortical blindness, to 
differentiate it from blindness due to disease of the eyes or optic 
nerves. We may, however, see things surrounding us with sufficient 
clarity to avoid colliding with them, that is to guide our general body 
movements but without being able to appreciate the meaning of things 
which we see. This was first demonstrated by Munk in dogs in which 
much of this part of the brain had been removed. They were able to 
avoid collisions but did not recognize their master or even food by sight 
alone, and did not cringe from a whip. To this condition Munk gave 
the name of mindblindness and its parallel has since frequently been 
recorded in cases of disease of the human brain. Apparently at the 
first level the visual area of the brain serves as a very accurate guide to 
motion and it probably also furnishes the element of awareness of the 
external origin of a sensation (as contrasted to a memory). In psy- 
chological terms it furnishes the pure perceptual element to sensation 
but simultaneous or additive activity in other higher level visual 
areas are requisite to attach meaning and again we know that this is 
not accomplished in one step. If destruction of brain tissue happens 
in a certain area there results a condition in which the patient not only 
can see correctly but can also understand the meaning of objects seen, 
but in which the ability to read the printed or written word is entirely 
lost. That vision in the ordinary sense is normal, is shown by the fact 
that such a patient can copy printed material but cannot read either 
the original or hiscopy. Thus we see from these differential losses that 
the process of linking a printed word to its meaning passes through at 
least three stages of elaboration in the brain before it is completed. 

There are differences, however, in the brain destruction necessary 
to produce losses at these different elaborative levels. Destruction in 
one hemisphere only is not sufficient to produce either cortical blind- 
ness or mindblindness. At these first two levels of elaboration, that is 
in perception and recognition of the meaning of objects, apparently 
destruction must involve the areas subserving these functions in both 
hemispheres before their loss results. The two hemispheres are appar- 
ently of equal importance here as it apparently makes no difference 
which side is affected; 7.e., either hemisphere is alone adequate for 
these functions. Exception must be taken to these statements in the 
case of peripheral vision but, as noted before, this is not of interest to 
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us here since central vision is used exclusively in learning to read. 


When we come to the third plane of elaboration, the situation is strik- 
ingly different, this is the level at which the written or printed symbol | 


is linked with its meaning and hence it is variously described as the 
associative, concept, or symbolic level. Here not only is damage to 
one hemisphere sufficient to destroy function but it makes a difference 
which hemisphere is affected. If the hemisphere which is known as 
the dominant happens to suffer, a complete loss of this function results 
and the patient becomes word blind. If, on the other hand, the 
damage occurs in the other hemisphere—the non-dominant—nothing 
apparently happens. So entirely without result is a destruction here 
that this area of the brain takes its place with certain others among 
those which the surgeons called the ‘‘silent areas”’ of the brain. Obvi- 
ously, the visual records implanted in both halves of the brain are not 
requisite for reading. This situation also exists in the field of under- 
standing of the spoken word, and of speech and of writing. In all 
four of these functions destruction in the dominant hemisphere in the 
so-called language zone is meaningful while destruction in exactly 
similar parts of the opposite hemisphere is meaningless. 

Thus we learn to understand, to read, to speak, and to write words 
from sensory records or engrams of one hemisphere only. This fact is 
so striking that we have been prone to overlook what must happen in 
the inactive side. We believe today that the completed growth and 
development of nerve cells is largely a result of stimulation. If cells 
do not receive stimuli they do not reach their full development. The 
two sides of the brain do not show much, if any, difference in size or 
complexity and certainly no such difference as we see in function as 
outlined above. To account for equality of growth we must accept 
equality of stimulation—equal nervous irradiation of the two sides— 
and if they are equally irradiated, records must be left behind in each; 
i.e., engrams must be formed in the non-dominant as well as in the 
dominant hemisphere. To account then for the difference in effect of 
damage in the two sides we must assume that the engrams of one side 
become the controlling pattern through establishment of a physiologi- 
cal habit of use of that set and that the other set of recorded engrams is 
latent or elided. Variations in the completeness of this physiological 
selection, 7.e., failure of elision of the non-dominant engrams, forms the 
kernel of my conception of the reading disability. Such a theory 
conforms nicely to our observations that these cases are not to be 
divided into two categories, that is, cases of word blindness and cases 
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of slow acquisition of reading, but that they form a series graded in 
severity according to the degree of confusion which exists in choice of 
engrams and it also offers an explanation of certain errors and peculi- 
arities which characterize their performance. 

The two halves of the body are strictly antitropic, that is, reversed 
or mirrored copies of each other. The muscles and joints of the right 
and left hand, for example, are alike but reversed in arrangement. 
This is also true of the groups of nerve cells in the spinal cord which 
control the simpler motor responses (spinal reflexes) and also of the 
cells in the brain which combine or integrate these simpler spinal units 
into more complex acts. The movements of the left hand, therefore, 
which are the exact counterpart of the right wiil give a mirrored result. 
Thus, the movements of sinistrad (mirror) writing with the left hand 
are exactly comparable to those of dextrad writing with the right hand 
and it seems therefore highly probable that the engrams which are 
stored in the silent areas of the non-dominant hemisphere are opposite 
in sign, 7.e., mirrored copies, of those in the dominant. If then these 
opposite engrams are not elided through establishment of consistent 
selection from one hemisphere we would expect them to evince them- 
selves by errors or confusion in direction and orientation and this is 
exactly what we find in cases of delayed reading. 

This description is really “‘ putting the cart before the horse” as our 
observations of tendency to reversals came first and the theory devel- 
oped therefrom but this method of presentation has been adopted for 
the sake of clarity. Many workers with word blind children have 
noted their tendency to reversals but none, so far as I am aware, have 
offered an adequate explanation of it. 

My original studies in a small group of cases convinced me that 
there were certain ‘“‘symptoms” in reading disability which seemed to 
characterize the whole group and these were confusions between lower 
case b and d and between p and q; uncertainty in reading short pallin- 
dromic words like was and saw, not and ton, and on and no; a tendency 
to reverse parts of words or whole syllables as when gray is read as gary, 
tarnish as tarshin and tomorrow as tworrom; a greater facility than usual 
in reading from the mirror, and frequently a facility in producing 
mirror writing. These observations have been adequately supported 
in an extended study of a much larger group of cases. Many other 
types of errors are to be found in the performance of retarded readers 
but they appear to me to be secondary effects due to the failure of 
association which has resulted from the obstacle presented by confusion 
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in direction. The relation of the cardinal symptoms to the theory as 
above outlined is obvious and I think has direct bearing on the teach- 
ing method. Visual presentation will, hypothetically at least, result 
in the implantation of paired engrams and certain other factors must 
determine which of these is selected for associative linkage. What 
these factors are as a whole, we can not consider here although it may 
be well to suggest that heredity probably plays a part in the establish- 
ment of dominance here comparable to that which it plays in stutter- 
ing and in left-handedness. Undoubtedly training influences may be 
brought to bear on this process of choice, however, and from the 
theoretical standpoint the most promising of these should be that of 
kinesthetic training by tracing or writing while reading and sounding 
and by following the letters with the finger (a method under taboo 
today) to insure consistent direction of reading during phonetic 
synthesis of the word or syllable. 

Under a grant from the Rockefeller Foundation, an extended field 
study was carried out in 1926-27 in lowa by the organization, as a part 
of the research work of the State Psychopathic Hospital, of a Mobile 
Mental Hygiene Unit to visit schools in various communities and a 
Laboratory Unit to study selected cases more intensively. Fuller 
reports of these studies are to appear elsewhere but certain observa- 
tions may be quoted here. In my original group of reading disability 
cases, I was surprised at the large proportion of these children encoun- 
tered. Fifteen out of one hundred twenty-five children sent by their 
teachers to our experimental field clinic for a variety of problems® 
seemed to me to show evidence of this trouble. In our extended work 
we have found in every community visited no less than two per cent 
of the total school population to be retarded readers showing this 
characteristic picture. Our studies were not carried out as a survey 
and hence these figures probably fall far below the actual numbers. 
There was however a difference in the numbers of cases encountered in 
certain communities which seemed to bear directly on the subjects 
here considered. Of two communities of about the same constituent 
population, in one we found about two per cent of the school popula- 
tion to be retarded in reading to a significant degree and to show 
symptomatic evidence of the specific disability, while in the second we 
found more then double this percentage. In the community with the 
lesser number of cases, sight reading methods were employed but when 
children did not progress by this method they were also given help by 
the phonetic method. In the town with the larger number, no child 
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was given any other type of reading training until he or she had 
learned ninety words by sight. 

Aside then from theoretical considerations, this strongly suggests 
that the sight method not only will not eradicate a reading disability 
of this type but may actually produce a number of cases. Moreover, 
our retraining experiments’ seem to indicate clearly that such children 
can be trained to read properly with adequate special methods devised 
to eradicate the confusion in direction and in orientation and this has 
also been borne out by the remedial efforts of other workers. 

Our studies of children with reading disabilities has also brought to 

light certain other aspects of the problem which are of educational 
importance but which can not be elaborated here. Among these were 
notably the effect of this unrecognized disability, upon the personality 
and behavior of the child. Many children were referred to our clinics 
by their teachers in the belief that they were feeble-minded, others 
exhibited conduct disorders and undesirable personality reactions 
which upon analysis appeared to be entirely secondary to the reading 
defect and which improved markedly when special training was insti- 
tuted to overcome the reading disability. 

In brief, while ‘“‘sight reading’ may give greater progress when 
measured by the average of a group, it may also prove a serious obstacle 
to educable children who happen to deviate from the average in the 
case of establishment of a clear cut unilateral brain habit. These 
physiological deviates form a graded group extending in severity from 
the normal to extreme cases (congenital word blindness). They can 
be detected by appropriate examinations and trained to overcome their 
handicap by specific methods of teaching. While the number of 
children who suffer from such a severe grade of the disability as to be 
practically uneducable by ordinary methods is quite small, the number 
in whom the disability exists to a sufficient degree to be a serious handi- 
cap to school performance and to wholesome personality development 
probably is of real numerical importance and moreover there seems to 
be reason to believe that even those who make a spontaneous adjust- 
ment. without special training, and thus learn to read, may never gain 


a facility in this accomplishment commensurate with their ability in 
other lines. 
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INTELLIGENCE TESTING 


The Kuhlman-Anderson Intelligence Tests Compared with Seven Others. The 
Journal of Applied Psychology, Dec., 1928, 545-594. Children (N, 1400) in 
Grades I to IX were tested. Tests other than Kuhlman were Pintner-Cunning- 
ham, Otis Group Primary, National Scale A, Otis SA Higher, Terman Group. 
Comparative norms, intercorrelations, and results regarding discriminative capaci- 
ties are contained in the article. 

The Performance of Pre-school Children of Different Social Groups on the Kuhl- 
man-Binet Tests. Florence L. Goodenough and Gertrude Shapiro. Journal of 
Educational Research, Dec., 1928, 356-362. In this study three hundred eighty 
children between eighteen and fifty-four months of age were tested twice with an 
average interval of six weeks between examinations. Group A (professional, 
semi-professional and managerial, clerical and skilled trades) of same mental level 
as Group B (semi-skilled trades, slightly skilled, and unskilled workers) proved 
decidedly superior to Group B in language tests, equivalent to Group B in acquired 
information and adaptive behavior tests, but inferior to Group B in motor tests. 

A Vocabulary Information Test. Angelina L. Weeks. Archives of Psychology, 
No. 97, May, 1928. The writer presents the returns from three vocabulary lists 
of fifty words each that were administered to over thirteen hundred pupils, Grades 
VII to XII. Correlations (N, 70 to 145) between vocabulary test and (a) National 
Intelligence, .41 to .53; and (b) Stanford Achievement, .75; and (c) Otis Classifica- 
tion, .45 to .74; and (d) Haggerty Reading, .58; and (e) Haggerty Vocabulary, .79; 
and (f) Otis SA, .77; and (g) school marks, .38 to .57. 

Correlation between Intelligence Test Scores and Success in Certain Rational 
Organization Problems. K. C. Garrison. The Journal of Applied Psychology, 
Dec., 1928, 621-630. Correlations are presented for these factors: Rational 
learning (analogy) and Otis, .50; and Detroit, .41. 

Power and Speed: Their Influence upon Intelligence Test Scores. Frank S. 
Freeman. The Journal of Applied Psychology, Dec., 1928, 631-635. Utilizing 
the results of Dearborn Group Tests administered to one hundred cases Grades VI 
and VII, the author concludes that this test primarily measures power. Correla- 
tions between single time and double time were from .76 to .91 for the different 
parts. 

Spearman’s Measure of Intelligence: A Statistical Analysis. John L. Hardie. 
The British Journal of Psychology, Oct., 1928, 188-197. Using seven hundred 
sixty-one pupils, the writer checked Spearman’s tests of intelligence against 
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measures of English and arithmetic abilities. Also the subtests of the intelligence 
test were checked against the total score. 


MEASUREMENT OF ACHIEVEMENT 


Achievement Curves and Their Trend through the Year. L.N.Garlough. Jour 
nal of Educational Research, Dec., 1928, 363-368. A class of fifty students (high 
school) in biology, average IQ 113, was administered fourteen objective tests at 
two-week intervals. Skewness, kurtosis, and variation from test to test were 
calculated. 

Retention of History in the Sixth, Seventh, and Eighth Grades with Special 
Reference to the Factors That Influence Retention. Sarah J. Bassett. The Johns 
Hopkins University Studies in Education, No. 12, 1928. The study involved the 
testing of 1364 children over a period of four to sixteen months. ‘“‘The mean 
amounts of history retained after intervals of four, eight, twelve, and sixteen 
months are eighty-six per cent, eighty-two per cent, seventy-seven per cent, and 
seventy-two per cent, respectively.’”’ Retention was correlated with mental age, 
chronological age, subject preference, interest and effort, reading comprehension, 
ability on a standard American history test, height, and weight. 

The Use of Informational Tests in American History Teaching. E. W. Thorn- 
ton. The Historical Outlook, Jan., 1929, 12-16. An analysis of the topics in 
ten objective tests in history published between 1915 and 1924 revealed the follow- 
ing: (1) seventy-four per cent of the topics cover the period 1607 to 1865, with 
only seventeen per cent since 1865; (2) average per cent of topics in ten texts 
devoted to (a) Military, 16.2, to (b) Social-economic-religious-education, etc., 15, 
and to (c) Political-geographic, 69. 

Report on Survey Test in History of the United States,Grade VIIB. Edwin C. 
Broome and Philip A. Boyer. The Historical Outlook, Jan., 1929, 17-23. This 
test was administered to 10,337 Philadelphia pupils. Norms are included in the 
article. 

Test on World History from Earliest Times through the Reformation. Sue 
Hefley. The Historical Outlook, Jan., 1929, 27-29. <A test composed of two hun- 
dred sixty-four items of these types of subtests: True-false, multiple choice, 
matching. 

Sex Differences in History Ability. James A. Fitzgerald and W. W. Ludeman. 
Peabody Journal of Education, Nov., 1928, 175-181. With a total of two hundred 
eight cases, in fact statements and reasoning statements the boys proved superior 
in both Grades VI and VII but inferior in Grade VIII to the girls. 

Standardized Tests of Ability to Use Correct English. J. E. Bathurst and others. 
Public Personnel Studies, Dec., 1928, 241-250. The test was administered to 
1628 persons in colleges, junior and senior high schools, night and business high 
schools, and a commercial business college. Norms are presented for groups 
separated on basis of education completed. 


PsyCHOLOGY OF LEARNING AND ScHOOL SUBJECTS 


The Use of Tests in the Teaching of the Social Studies. Howard C. Hill. The 
Historical Outlook, Jan., 1929, 7-10. New methods of using objective tests for 
teaching are suggested. 
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The Use of Objective Tests in Teaching as Illustrated by Grammar. Estelle L. 
Maloney and G. M. Ruch. The School Review, Jan., 1929, 62-66. A total of 
four hundred ninety seven pupils in Grades IX, X, and XI served as cases. 
“The (objective test drill) method proved superior to the other two methods in all 
grades as judged by the final tests. The combination method was second in every 
case, and the textbook method was last.” 

An Experiment in Method for Junior Standards. Eve Macauly. The Forum 
of Education, Nov., 1928, 217-240. The individual method proved superior to 
the group with eight year old children (N, 36 each group) over a period of one 
year. 

The Cost of Quick Shifting in Number Learning. Garry C. Myers and Caroline 
C. Myers. Educational Research Bulletin, Oct. 31, 1928, 327-334. An investiga- 
tion of one hundred sixty four cases in Grades V and VI, and in a normal school 
to determine the comparative rates and errors resulting from a drill in mixed 
processes of arithmetic and in a drill in grouped processes. 

An Experiment with Remedial Work in Common Fractions. Elma A. Neal and 
Inez Foster. The Elementary School Journal, Dec., 1928, 280-283. A device 
for remedial work is suggested. This device proved effective over a period of 
three months with large groups of children. 

A Supervision Project in Multiplication. Paul B. Clemens and Paul F. Neu- 
bauer. Journal of Educational Research, Dec., 1928, 387-396. The errors 
occurring in objective tests (two hundred ten examples) given to 2000 pupils, 
Grades IVB to VIIIA, are analyzed and presented in table form. 

A Study of One Factor in the Grade Placement of Reading Materials. Prudence 
Cutright, George P. Halvorson, and L. J. Brueckner. The Elementary School 
Journal, Dec., 1928, 284-295. A vocabulary test and a paragraph reading com- 
prehension test both based upon the book, “‘ Heidi,’ were formulated and adminis- 
tered to two hundred eighty two pupils, Grades IV, V, and VI. 

An Experiment in Remedial Reading Exercises at the College Level. H. H. 
Remmers and J. M. Stalnaker. School and Society, Dec. 22, 1928, 797-800. 
Systematic distributed practice in reading netted significant gains in speed and 
comprehension with seven students over a period of three hours practice time. 

Vocabulary versus Content in Junior High School Social Studies. Tyler Kepner. 
The Historical Outlook, Jan., 1929, 30-33. Seventeen social studies teachers 
recorded all words that approximately fifty per cent of the pupils in their classes 
did not comprehend. (N, 1200; Grades VII, VIII, and IX.) These words were 
compared with other vocabulary studies. 

A Study of General Science Textbooks. Ailsie M. Heineman. General Science 
Quarterly, Nov., 1928, 11-23. An objective study of the principles and applica- 
tions in twenty textbooks published 1915 to 1926. 

A Tentative Vocabulary for First Year French Students. Florence M. Baker. 
Journal of Educational Research, Dec., 1928, 369-377. Combining the Hemnon 
French Word List, the Thorndike Word List, and an analysis of compositions in 
English and French, the writer constructed a vocabulary of four hundred words. 

An Experiment in First Year French. Charles E. Young and Josephine M. 
Daus. The Modern Language Journal, Feb., 1928, 356-364. With college 
students (one hundred fifty-two, experimental group; one hundred forty-two, 
control group) systematic techniques for developing grammar, vocabulary, idioms, 
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pronunciation, reading, ear training proved much more effective than customary 
methods of teaching over a period of one year. 

An Indicated Effect of Oral Practice. Arthur G. Bovee. The Modern Language 
Journal, Dec., 1928, 178-182. Using the French adaptation of the Thorndike- 
McCall Reading Scale on thirty to forty-five pupils, the author concludes that 
much reading practice is necessary, some of which should be oral. 

Grammarless Reading of Foreign Languages. James Burton Tharp and Eloise 
Murray. The Modern Language Journal, Feb., 1928, 385-390. With college 
freshmen in French, the “reading sections’’ gained significantly more in vocabulary 
and reading but much less in grammar than did the ‘‘regular sections.” 


CHILD PsYcHOLOGY 


Measuring Behavior Traits by Means of Repeated Short Samples. Florence 
L. Goodenough. The Journal of Juvenile Research, Sept.—-Dec., 1928, 230-235. 
“This consists simply in the observation of the everyday behavior of an individual 
or & group of individuals for definite short periods of time and the recording of the 
occurence or non-occurence of certain specified and objectively defined forms of 
behavior during each of these periods.’”” The method revealed reliability coeffi- 
cients ranging from .324 to .871, average .628, for the several traits rated. 

Some Character and Personality Problems of Remedial Cases in Reading. Laura 
Zirbes. Childhood Education, Dec., 1928, 171-176. A report of seven case 
studies. 


EDUCATIONAL PsYCHOLOGY 


A Note on the Beta Hypothesis of Learning. James Quinter Holsopple and Irene 
Vanouse. School and Society, Jan. 5, 1929, 15-16. With eleven students of 
typewriting using two methods for correcting errors in spelling, the method in 
which the students voluntarily practised the word as habitually incorrectly written 
proved decidedly superior to the method in which the students practised the word 
correctly written. 

Do Groups Think More Effectively than Individuals? Goodwin B. Watson. 
The Journal of Abnormal and Social Psychology, Oct.-Dec., 1928, 328-336. 
The cases, one hundred eight graduate students in education, were practised in 
four ten-minute periods in word-building, two periods as individuals and two as 
groups. In number of words, group thinking was distinctly superior to that of the 
average individual and even to that of the best member of the group; individual 
plus group was superior to either method separately. The larger the group the 
more superior the group product. The group product was largely dependent 
upon the best individuals in that group. 

Modes of Emphasis in Public Speaking. Arthur Jersild. The Journal of 
Applied Psychology, Dec., 1928, 611-620. Lectures submitted to two hundred 
fifty-three college students who were tested tended to show that (1) three or more 
distributed repetitions of statements or ideas, (2) the opening statements, and (3) 
verbal comments directing attention to an item were most effective for inducing 
recall. 

Influence of the Study of Modern Foreign Languages on the Development of 
Abilities in English. Oscar H. Werner. The Modern Language Journal, Jan., 
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1928, 241-260. Cases—high school: control, 286, experimental (foreign language 
group), 392; college freshmen: control, 69; experimental, 69. Subjects—French, 
Spanish, German. Tests—Pressey Punctuation, Pressey Sentence Structure, 
Charters Diagnostic Language and Grammar, Purdue English, Monroe Silent 
Reading, Thorndike-McCall Reading. Time elapsed between two testings—one 
school year. The writer concludes; (1) the evidence indicates clearly that the 
study of modern foreign languages aids in the development of speed and compre- 
hension in reading, especially with high school pupils, (2) aids in the development 
of ability in grammar and in vocabulary, (3) that it is doubtful whether the study 
aids in punctuation, sentence structure and discovery of speech errors, and (4) that 
the higher the IQ, the more likely modern language effects an improvement in 
desirable English abilities. 

Speed and Accuracy as Factors in Objective Tests in General Psychology. Howard 
P. Longstaff and James P. Porter. The Journal of Applied Psychology, Dec., 
1928, 636-642. Fourteen tests, multiple answer type, in general psychology were 
administered to one hundred eighty-six university students. Average correla- 
tions: (1) inter-accuracy for (a) lecture tests, .45, for (b) laboratory tests, .41; (2) 
inter-speed for (a) lecture tests, .51, for (6) laboratory, .29; (3) speed and accuracy, 
.07. 


CHARACTER AND PERSONALITY TRAITS 


The Predictive Value of Certain Tests of Emotional Stability as A pplied to College 
Freshmen. Edwin G. Flemming. Archives of Psychology, No. 96, 1928. A 
study involving three hundred forty-one Columbia College freshmen measured by 
Woodworth’s Personal Data Sheet, Laird’s Personal Inventory, the Pressey Cross- 
Out Test, and the Thorndike Intelligence Examination. These data exhibit 
slight negative and slight positive associations with items of the personnel records 
regarding studiousness, social activity, motor activity, sleep, work, and illness. 

Differences between College Students and Their Elders in Standards of Conduct. 
Alice Anderson and Beatrice Dvorak. The Journal of Abnormal and Social 
Psychology, Oct.-Dec., 1928, 286-292. The questionnaire (multiple-choice) 
submitted to one hundred sixty-two adults in five groups showed that (1) youth, 
differing from elders, prefers standards of prudence and aesthetics to that of right 
and wrong, and (2) differences are larger between age than between sex groups. 

Group Tests of Psychopathic Tendencies in Children. Anna 8S. Elonen and 
Herbert Woodrow. The Journal of Abnormal and Social Psychology, Oct.—Dec., 
1928, 315-327. A total of one hundred sixty-two subjects, ages 9-11 to 15-5, 
Grade VI, were tested with the National A, psycho-neurotic questionnaire, and 
a free association test, and were rated by their teachers and principals. Correla- 
tions: (1) Pathological association scores and (a) amalgamated ratings, .57, (b) 
intelligence score, —.28, and (c) IQ, —.31; (2) Woodworth-Cady scores and (a) 
ratings, .25, and (b) pathological scores, .20, and (c) intelligence, .04; (3) ratings 
and intelligence with pathological association scores, .73. 

Sex Differences in Credulity. Harvey C. Lehman and Paul A. Witty. The 
Journal of Abnormal and Social Psychology, Oct.—Dec., 1928, 356-367. A signi- 
ficantly larger per cent of girls than boys, ages 8)4 to 22, participated in ‘Telling 
fortunes or having fortunes told,” the discrepancy in per cents between sexes 
increasing with age. 
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EDUCATIONAL AND VOCATIONAL GUIDANCE 


Why “Honor” Engineering Students Think They Succeed in College. M. E. 
Herriott. School and Society, Dec. 29, 1928, 829-830. By group and individual 
conferences with forty-eight honor students, two types of outstanding reasons were 
mentioned: Study habits and attitudes toward school work. 

A Project in Student Personnel Service at the College Level. Lonzo Jones. School 
and Society, Dec. 15, 1928, 765-768. The writer finds that college achievement 
can be raised by assigning students achievement standards based on a battery of 
tests (mathematics and English—training and aptitude, reading comprehension, 
high school content, and psychological). Methods for promoting achievement 
according to ability are suggested. 

How Do Children Spend Their Time? Bronett Goldberg and Luella C. Pressey. 
The Elementary School Journal, Dec., 1928, 273-276. The returns from a ques- 
tionnaire submitted to three hundred nine children in Grades IV, V, and VI indi- 
cated that about ten hours per day were spent in sleeping; seven hours, in school; 
and three hours, in amusements, etc. 

The Social and Educational Status of the Pupils in a Residential Suburban 
Community. J. ¥F.Towell. Jan., 1929, 49-58. From a question blank regarding 
father’s education, occupational status, membership in clubs, the returns are 
compiled for the community. 

Baltimore Industry and Elizabeth Bent Are Helped by These Personnel Cards. 
Leora C. Buchwald. Industrial Psychology, Nov.—Dec., 1928, 477-487. The 
Baltimore Record Cards are described and presented in fac-simile. 

Ability Classification in Ninth Grade Algebra. L. E. Mensenkamp. The 
Mathematics Teacher, Jan., 1929, 38-48. With four hundred six cases the follow- 
ing correlations were secured: First semester algebra mark and second semester 
Grade VIII arithmetic mark, .54; algebra mark and Otis IQ, .42; arithmetic mark 
and IQ, .35. 


MISCELLANEOUS 


The Analysis and Evaluation of College and University Courses in Education. 
W.E. Peik. Journal of Educational Research, Dec., 1928, 345-355. The article 
presents ingenious methods (involving analysis of content and methods of courses, 
ratings by alumni of these, etc.) by which the university revised its curriculum in 
education. 

Can Students Discriminate Traits Associated with Success in Teaching? J. M. 
Stalnaker and H. H. Remmers. The Journal of Applied Psychology, Dec., 1928, 
602-610. With the Purdue Rating Scale for Instructors these results were 
obtained: (1) the correlation showing student agreement in the importance of 
traits was .92 to .95; (2) students showed little halo effect in discriminating traits 
for rating purposes. 

An Interpretation of the Heredity Background of Two Groups of Mental Deviates. 
Paul A. Witty and Harvey C. Lehman. The American Journal of Sociology, 
Sept., 1928, 316-329. Facts regarding racial origin, physical heredity, and mental 
heredity for two groups—one with IQ 140 or above (N, 50), the other with IQ 70 
or below (N, 50)—were collected. Striking differences favoring the brighter group 
were found. 
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Ten Years of Educational Research, 1918-1927. Walter S. Monroe and others. 
Bureau of Educational Research, College of Education, University of Illinois, 
Bulletin No. 42, Aug. 21, 1928. 

.Dentition as a Measure of Maturity. Psyche Cattell. Harvard Monographs 
in Education, 1928. Dentition norms are presented for various chronological 
ages, races, sexes, and intelligence levels. 
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CHARACTER EDUCATION 


Studies in Deceit, by Hugh Hartshorne and Mark A. May. New 
York: Macmillan, 1928. Book I, pp. XXI + 414. Book II, 
pp. VIII + 306. 


This is unquestionably the most important research and probably 
the most important book which has so far appeared in the field of 
character education. Its significance arises in part from its content 
and in part from its wealth of new procedures pointing toward further 
scientific attacks on the nature of character. 

Cheating, stealing, and lying behaviors are measured in school, 
home, party, and athletic contest situations. The battery of decep- 
tion tests contains twenty-nine opportunities to cheat. Two types of 
lying are measured. Two opportunities to steal money and one to 
steal small articles are employed. In the deception tests the essential 
technique is to measure or estimate achievement under honest condi- 
tions and then to measure achievement under conditions permitting 
dishonesty. The degree to which achievement under dishonest condi- 
tions exceeds probable or honest achievement measures the amount of 
deception. A ‘‘fact’’ cheating score is also derived for each deception 
test. This is defined as achievement under dishonest conditions which 
would occur under honest conditions only once in a thousand times. 
The essential technique involved in the stealing tests is to give the 
child an opportunity to take money or small articles under such condi- 
tions that there is no apparent possibility of his being caught and 
under such conditions that the experimenters can objé¢tively deter- 
mine those who did steal. The reliabilities of the deception tests vary 
from .24 to .87, the median reliability being .75. The special types of 
deception studied, especially school room deception, have been meas- 
ured with a very high degree of validity. But the authors conclude 
that to measure all types of deception would require a more extensive 


battery. 
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The major centers of interest are the factors associated with decep- 
tion, the evaluation of certain methods of character education, and the 
nature of deceptive behavior. The chief factors associated with 
deception are low intelligence, poor home environment, and suggesti- 
bility. Friends in the same classroom and siblings show considerable 
resemblance in deception. Retardation, emotional instability, poor 
deportment, low scholastic standing, and excessive movie attendance 
are significantly related to deception. There are significant differences 
between racial and religious groups. Age, sex, school grade, and 
physical condition are essentially uncorrelated with deception. 

On the whole the evaluation of current methods of character 
education raises serious questions of their worth. One elaborately 
worked out method of character education actually proved to make for 
more dishonesty among those children who attained the higher ranks 
in its organization. A well established and nation wide organization 
for developing character among boys failed to show positive results. 
But progressive schools showed less deception than conventionally 
organized schools. 

The tendency to deceive is specific. If children are tested in the 
same situation with the same type of materials and the same oppor- 
tunity to deceive, the intercorrelations of test scores are high. But if 
the material is slightly changed or if children have the opportunity of 
copying from a key instead of adding on scores, intercorrelations 
between tests fall markedly. If in addition to these changes, a school 
room situation is compared with an out of school situation, the inter- 
correlations of tests of deception fall still more. To an overwhelming 
degree the total situation rather than some trait of honesty or dis- 
honesty determines whether the child will or will not deceive. 

Both the practical worker and the research student will find this 
book significant. The reviewer has had the opportunity of using it 
with three groups of undergraduate students at the University of Iowa 
in a course in methods of character education. In a new field where 
insight must be largely gained by uncertain inferences from the wisdom 
of other fields, students responded enthusiastically to the wealth of 
concrete detail dealing directly with the most vital aspect of education. 
With two small groups of graduate students trained in statistics and 
mental tests the reviewer has used the book in a course in the measure- 
ment of character and personality and has found the same enthusiastic 
response. The students in the course on methods read essentially all 
of Book I with the question ‘‘What does this mean for a practical 
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program?’’ The students in the course on measurements read parts 
of Books I and II ignoring entirely inferences for a practical program 
and concentrating on an evaluation of tests, statistical techniques, and 
scientific methods. That two different classes with different training 
and specialized interests profited by study of this book is a significant 
tribute to its organization, its clarity of exposition, and its content. 

Only those students who have labored mightily (and for the most 
part they have labored without success) at the problem of measuring 
character are in a position to assay the true accomplishment of this 
book. Progressively for the past eight years they have flooded the 
literature with a hundred and one suggested measures and with nearly 
as many reports of additional data on the Downey will-temperament 
tests. The reviewer does not wish to belittle the literature on the 
measurement of character and personality. On the contrary he holds 
some of it in high regard. But as a student who is interested in the 
scientific groundings of a practical program of character education 
this literature was mostly confusing and discouraging in its prospects 
for the future. With this discouragement as a background, the almost 
bewildering variety of tests of deception inspires new confidence that 
large areas of the problem may yet be subjected to scientific scrutiny. 

FRANK K. SHUTTLEWORTH. 


University of lowa. 





PROBLEM IN BEHAVIORISM 


Human Behavior, by Walter S. Hunter. Chicago: University of 
Chicago Press, 1928. Pp. X + 355. 


The present volume is a revision of the author’s earlier volume, 
‘‘General Psychology,” in the sense that the organization and much 
of the material is the same. It is hardly a revision in another sense, 
and Hunter himself does not seem to consider it such, since the point 
of view has been changed. 

‘Human Behavior” is an attempt to state the problems of human 
nature in behavioristic terms. The author says in his preface: ‘‘The 
more I have studied and reflected upon problems of human nature, 
the more I have become convinced that the approach to these problems 
in terms of consciousness is invalid.” The restatement of essentially 
the same evidence in behavioristic terms by the same author offers 
interesting possibilities and the comparison of the two volumes should 
be valuable source material. It might be a stimulating experiment to 
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take the chapter in each volume on “Thinking” and see which one 
proves to be more effective in helping students think more effectively 
(about thinking) which is, I suppose, the purpose of a chapter on 
thinking. 

Most instructors will find the discussion of topics too brief to make 
the book important for source material. The volume does have useful 
experimental evidence, chiefly in the field of animal behavior. The 
book is interestingly written and the increase in page size for the same 
amount of print is to be commended. RaurH B. SPENCE. 

Teachers College. 





THE LEARNING OF CONCRETE AND ABSTRACT NUMBERS 


The Development of Children’s Number Ideas in the Primary Grades, 
by William A. Brownell. Supplementary Educational Mono- 
graphs, No. 35. Chicago: University of Chicago, August 1928. 
Pp. XIII + 241. 


As the present methods of arithmetic instruction have been assailed 
as unsatisfactory, this monograph is an attempt to probe a little more 
deeply into the learning of concrete and of abstract number, and upon 
this knowledge to base suggestions as to more advisable teaching 
techniques therefor. 

Altogether 1858 pupils distributed in Grades 1B-VA, VIA, and VIIA 
were used in this study. Exposure cards revealed the numbers 3-12 
arranged in quadratic, diamond, domino, triangular, odd and linear 
groupings of dots. Pupil responses to these number pictures were 
estimates of the number expressed as written digits, or the drawing of 
these groups separately. 

In the development of ability to apprehend these visual concrete 
numbers, the author found that: (1) pupil difficulties were directly 
proportional to the number of objects exposed, provided the factor of 
grouping be eliminated; (2) although amounts of intelligence and 
of school instruction must be considered, the most adequate index of 
development is the maturity of one’s process of apprehension as 
measured by this arbitrarily defined scale of increasing amounts 
(counting, partial counting, grouping, multiplication and conversion) ; 
and (3) efficient apprehension of concrete number is prerequisite to the 
understanding of abstract number upon which is based the learning of 
additive combinations which, in turn, facilitates the making of second 
additions in three digit column addition. 
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The reviewer is critical of these assertions made by the author: 
(1) Statistical procedures are inaccurate, based upon the use of certain 
direct and partial correlations only; (2) group tests cannot reveal the 
processes of thinking responsible for the product; (3) proponents of 
bond analysis do not encourage a pupil to make use of facts already 
learned in acquiring new facts although they do encourage mere 
establishment of bonds, which means that method of performance may 
be safely disregarded; (4) drill is an instructional technique and not a 
means to reach ‘‘meaningful habituation; and (5) the tendency to 
generalize cannot be the product of specific teaching. 

Although in this case study the tendency of the author was to 
generalize from the learning processes of one and two individuals and 
then to check these generalizations by a further, single instance, these 
analyses of the processes of thinking within each level of arithmetic 
learning should result in wholesome changes in certain teaching devices 
during instruction in primary arithmetic leading toward more meaning- 
ful habituation of the processes of thinking involved in concrete and in 
abstract number. Winona M. Perry. 

University of Nebraska. 





A PsYcCHOLOGICAL DICTIONARY 


Our Minds and Our Motives, by Paul D. Hugon. New York: G. P, 
Putnam’s Sons, 1928. Pp. VII + 475. 


“The object of this book is to present in one volume, arranged for 
instant reference, in language as simple as is consistent with accuracy, 
the facts and theories of human behavior, together with such notions 
of philosophy as may be necessary for their understanding.” 

Mr. Hugon is more ambitious than the most learned of our sages. 
We wonder whether it is an ambition born of ignorance or whether 
Mr. Hugon is having a little joke. 

The book is written in the form of an encyclopediac dictionary. 
It is a conglomeration of mixed fact and fancy. For example his dis- 
cussion of ability, general and specific, is essentially reasonable and 
sound, as is his discussion of habit, as far as it is known, etc., but his 
discussion of business ability of which we read, ‘‘There is no better 
means of ascertaining rapidly and surely the ability of a business man 
than the study of his hand writing,” and his discussion of intelligence 
of which we read, “Intelligence can be determined with 95-100 per 
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cent accuracy from hand writing,” are rather amusing. These are just 
examples of the material in the book. C. 8S. SLOcoMBE 
Boston Elevated Railway. 





AN Oup PROBLEM ATTACKED 


The Problem of Stuttering, by John Madison Fletcher. New York: 
Longmans, Green and Co., 1928. Pp. XIV + 362. 


In every hundred children one child is afflicted with stuttering 
speech. The efforts of educators to meet his problem up to this time 
have not met with an appreciable success and the majority of these 
children continue under their handicap throughout life. Although the 
phenomenon of stuttering has long been known to science most of the 
literature concerning it is in the nature of unsubstantiated opinion 
from which various methods of treatment, ‘“‘inadequate and in impor- 
tant particulars unscientific” have developed. This volume, according 
to its author, ‘‘represents an attempt to offer certain explanations of 
the apparent failure of present methods, together with a new, and it is 
hoped, a more psychologically sound method of approach to this very 
old problem.” 

Dr. Fletcher reviews in a very readable fashion the most significant 
contributions dealing with the rature of stuttering and its remedial 
treatment. In comparing the opinions of physicians, psychologists, 
psychiatrists and educators upon the problems involved in determining 
techniques adequate for coping with the malady, he calls attention to 
inconsistencies and fallacies in their attempts to arrive at causal 
factors. Probably the most valuable service of the volume lies in the 
assembling of statistical data hitherto inaccessible to investigators to 
whom extensive library facilities were not available. A suggested 
terminology for differentiating the various types of speech difficulties 
has won almost immediately the approval of a group of specialists. 

“The new and more psychologically sound”’ method for the cure 
of stuttering, promised by the author in his preface, gives the practical 
clinician little he can use in effecting permanent cures. The method 
advocated by Dr. Fletcher is subject to the same criticism he gives the 
techniques used by psychiatrists, teachers of elocution, and physicians. 
It is directed toward no clearly defined ends nor toward the eradication 
of factors causing the disturbance, but seeks to build up a definite set 
_of speech skills by such vague and generalized methods as “‘environ- 
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ment therapy,” ‘‘social adjustment” and ‘educational prophylaxis.” 
He believes voice and speech exercises are ‘‘to be condemned,” but that 
specially trained and equipped teachers should have charge of the 
stutterer under special environmental conditions adapted to his needs. 
How such teachers should be trained and equipped and what are ideal 
environmental conditions he does not say. 

Nevertheless his volume is a real contribution to the advancement 
of a more scientific means of attacking this perplexing problem, for he 
has given the investigator a careful evaluation of the literature concern- 
ing efforts to determine the nature of stuttering and its cure as well as 
an impetus to undertake more careful and extensive studies than those 
now available. EvizaBETH D. McDoweELt. 

Teachers College, Columbia University. 





Measurable Outcomes of Individual Laboratory Work in High School 
Chemistry, by Ralph E. Horton. New York: Bureau of Publica- 
tions, Teachers College, Columbia University, 1928. Pp. VIII 
+ 105. 


The chemical laboratory in the high school today is expensive 
equipment. Should it be dispensed with? The teacher demonstra- 
tion method in use in Europe has received much attention. Results of 
attempts to compare it with the individual laboratory practice in the 
United States were contradictory. Faced with such a dilemma, Dr. 
Horton sought more evidence for a working plan for the teaching of 
chemistry in the school in which he was employed. Specifically his 
problem was: What are some measurable outcomes of individual 
laboratory work in a course in high school chemistry? 

The problem is not a simple one. It is exceedingly complex and 
the research is worthwhile for having found answers to so many ques- 
tions. It is not the province of this report to detail the investigation, 
even if space permitted. Those who will appreciate the findings most 
will desire to read the dissertation for themselves. The major findings 
deserve to be set forth, nevertheless. From this investigation labora- 
tory work is seen to have value in itself. It appeared to contribute 
little to passing written tests in chemistry. Yet, individual laboratory 
work is a stimulus to interest in chemistry, practically nine out of ten 
pupils preferring it to the teacher demonstration. Such an interest is 
worth knowing. Further investigation helped to set up a list of 
fifty-five Jaboratory manipulations ranked for importance by experts 
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in the field. Such a list is invaluable to curriculum makers. Dr. 
Horton found evidences of over-emphasis in individual laboratory 
work and was able to attain desired results in fewer periods while 
improving laboratory work at the same time. Experimental evidence 
was found for the belief that practice in the laboratory develops better 
skill to manipulate materials than does the demonstration method. 
So long as colleges require certification of such skills (which is undoubt- 
edly desirable for those students who undertake advanced work in 
chemistry) individual laboratory work is desirable. Furthermore, 
this research indicates a method of setting up a test of basic manipula- 
tive skills. 

Two aspects of this research are noteworthy. The first relates to 
“the proposal . . . to test experimentally the possibilities of formal 
instruction in scientific thinking at the high school level in chemistry 
classes.’”” The outcome was a technique in which pupils successfully 
devised their own experiments. It should be stimulating reading for 
many science teachers. 

The second aspect is concerned with the method of the research. 
Preliminary work was undertaken and proved inclusive. However, 
certain implications were evident and were stated. Then followed a 
plan to validate these implications. Most of the later evidence is 
conclusive and the statement of results is in keeping with this evidence. 
Such validation is no mean achievement at a time when it is not 
unusual to find that the chief results of doctoral research is little more 
than the formulation of more specific problems worthy of investigation. 

J. H. CoLemMan. 
Graduate Student, Teachers College. 
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